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(54) Titie: METHODS AND APPARATUS FOR DETERMINATION OF LENGTH POLYMORPHISMS IN DNA 
(57) Abstract 

Methods and apparatus are provided for the analysis and determination of the nature of repeat units in a genetic target. In one method 
of this invention, the nature of the repeat units in the genetic target is determined by the steps of providmg a plurality of hybridization 
complex assays arrayed on a plurality of test sites, where the hybridization complex assay includes at least a nucleic acid target containing 
a simple repetitive DNA sequence, a capture probe having a first unique flanking sequence and n repeat units, where n = 0. 1, 2..., or 
fractions thereof, being complementary to the target sequence, and a reporter probe having a selected sequence complementary to the same 
target sequence strand wherein the selected sequence of the reporter includes a second unique flanking sequence and m repeat units, where 
m « 0, 1 , 2..., or fractions thereof, but where the sum of repeat units in the capture probe plus reporter probe is greater than 0 (n+m>0). 
In accordance with this method, the sequence of the capture probe differs at least two test sites. Hie hybridization complex assays are 
then monitored to determine concordance and discordance among the hybridization complex assays at the test sites as determined at least 
in part by hybridization stability. Ultimately, the nature of the repeat units in the target sequence may be determined based upon the 
concordant/discordant determination coupled with knowledge of the probes located in the hybridization complex at that site. In one aspect 
of this invention, electronic stringency control may be utilized either during the initial hybridization phase, such as to aid in the proper 
indexing of hybridized materials, or during the concordance/discordance phase, or l)0th. In yet another aspect of this invention, a system 
includes a plurality of sites wherein each site includes at least a probe having a first unique flanking sequence, a second unique flanking 
sequence and an intervening repeat unit series having a variable number of repeat units. The existence of loop-outs, or conditions of exact 
concordance, may be detennined. Applications include paternity testing, forensic use, and disease diagnostics, such as for the identification 
of the existence of a clonal tumor. 
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DESCRIPTION 



METHODS AND APPARATUS FOR DETERMINATION OF 
LENGTH POLYMORPHISMS IN DNA 



Field of the Invention 

The methods and apparatus of these inventions relate to systems for genetic 
identification for disease state identification. More particularly, the methods and apparatus 
10 relate to systems for the detection of repeat unit states, such as the number of short tandem 
repeat imits for the identification of individuals such as in a forensic or paternity sense, or for 
determination of disease states, such as for clonal tumor detection. 



Related Application Information 

15 This application is related to AppUcation Serial No. 08/986,065, filed December 5, 

1997, entitled "METHODS AND PARAMETERS FOR ELECTRONIC BIOLOGICAL 
DEVICES", which is a continuation-in-part of application Serial No. 08/534,454, filed 
September 27, 1995, entitied "APPARATUS AND METHODS FOR ACTIVE 
PROGRAMMABLE MATRIX DEVICES", which is a continuation-in-part of application 

20 Serial No. 08/304,657, filed September 9, 1994, entitled "AUTOMATED MOLECULAR 
BIOLOGICAL DL^GNOSTIC SYSTEM," now issued as United States Patent No. 5,632,957, 
(which has been continued into application Serial No. 08/859,644, filed May 20, 1997, 
entitled "CONTROL SYSTEM FOR ACTIVE, PROGRAMMABLE ELECTRONIC 
MICROBIOLOGY SYSTEM"), which is a continuation-in-part of application Serial No. 

25 08/271,882, filed July 7, 1994, entitled "METHODS FOR ELECTRONIC STRINGENCY 
CONTROL FOR MOLECULAR BIOLOGICAL ANALYSIS AND DIAGNOSTICS," now 
allowed, which is a continuation-in-part of Serial No. 08/146,504, filed November 1, 1993, 
entitled "ACTIVE PROGRAMMABLE ELECTRONIC DEVICES FOR MOLECULAR 
BIOLOGICAL ANALYSIS AND DIAGNOSTICS", now issued as U.S. Patent No. 

30 5,605,662, (which has been continued into application Serial No. 08/725,976, filed October 
4, 1996, entitled "METHODS FOR ELECTRONIC SYNTHESIS OF POLYMERS"), and 
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also a continuation-in-part of application Serial No. 08/708,262, filed September 6, 1996, 
entitled "METHODS AND MATERIALS FOR OPTIMIZATION OF ELECTRONIC 
HYBRIDIZATION REACTIONS", all incorporated herein by reference as if fiilly set forth 
herein. 

5 

Background of the Invention 

Molecular biology comprises a wide variety of techniques for the analysis of nucleic 
acid and protein. Many of these techniques and procedures form the basis of clinical 
diagnostic assays and tests. These techniques include nucleic acid hybridization analysis, 

1 0 restriction en2yme analysis, genetic sequence analysis, and the separation and purification of 
nucleic acids and proteins (See, e.g., J. Sambrook, E, F. Fritsch, and T. Maniatis, Molecular 
Cloning: A Laboratory Manual. 2 Ed., Cold spring Harbor Laboratory Press, Cold Spring 
Harbor, New York, 1989). 

Most of these techniques involve carrying out numerous operations (e.g., pipetting, 

15 centrifiigation, electrophoresis)on a large number of samples. They are often complex and 
time consuming, and generally require a high degree of accuracy. Many a technique is limited 
in its application by a lack of sensitivity, specificity, or reproducibility. For example, these 
problems have limited many diagnostic applications of nucleic acid hybridization analysis. 
The complete process for carrying out a DNA hybridization analysis for a genetic or 

20 infectious disease is very involved. Broadly speaking, the complete process may be divided 
into a number of steps and substeps. In the case of genetic disease diagnosis, the first step 
involves obtaining the sample (blood or tissue). Depending on the type of sample, various pre- 
treatments would be carried out The second step involves disrupting or lysing the cells, which 
then release the crude DNA material along with other cellular constituents. Generally, several 

25 sub-steps are necessary to remove cell debris and to purify fiirther the crude DNA. At this 
point several options exist for fiirther processing and analysis. One option involves 
denaturing the purified sample DNA and carrying out a direct hybridization analysis in one 
of many formats (dot blot, microbead, microplate, etc.). A second option, called Southern 
blot hybridization, in volves cleaving the DNA with restriction enzymes, separating the DNA 

30 Augments on an electrophoretic gel, blotting to a membrane filter, and then hybridizing the 
blot with specific DNA probe sequences. This procedure effectively reduces the complexity 
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of the genomic DNA sample, and thereby helps to improve the hybridization specificity and 
sensitivity. Unfortunately, this procedure is long and arduous. A third option is to carry out 
an amplification procedure such as polymerase chain reaction (PGR), strand displacement 
amplification or other method. These procedures amplify (increase) the nimiber of target 
5 DNA sequences relative to non-target sequences. Amplification of target DNA helps to 
overcome problems related to complexity and sensitivity in genomic DNA analysis. After 
these sample preparation and DNA processing steps, the actual hybridization reaction is 
performed. Finally, detection and data analysis convert the hybridization event into an 
analytical result. 

10 Nucleic acid hybridization analysis generally involves the detection of a very small 

number of specific target nucleic acids (DNA or RNA) with an excess of probe DNA, among 
a relatively large amount of complex non-target nucleic acids. The substeps of DNA 
complexity reduction in sample preparation have been utilized to help detect low copy 
numbers (i.e. 10,000 to 100,000) of nucleic acid targets. DNA complexity is overcome to 

15 some degree by amplification of target nucleic acid sequences using polymerase chain 
reaction (PGR) and other methods. (See, M.A. Innis et al, PGR Protocols: A Guide to 
Methods and Applications. Academic Press, 1990, Spargo et al., 1996, Molecular & Gellular 
Probes, in regard to SDA amplification). Amplification results in an enormous number of 
target nucleic acid sequences that improves the subsequent direct probe hybridization step. 

20 The actual hybridization reaction represents one of the most important and central 

steps in the whole process. The hybridization step involves placing the prepared DNA sample 
in contact with a specific reporter probe, at a set of optimal conditions for hybridization to 
occur to the target DNA sequence. Hybridization may be performed in any one of a number 
of formats. For example, multiple sample nucleic acid hybrid-ization analysis has been 

25 conducted on a variety of filter and solid support formats (See G. A. Beltz et al., in Methods 
in Enzvmoloigy, Vol 100, Part B, R. Wu, L. Grossman, K. Moldave, Eds., Academic Press, 
New York, Chapter 19, pp. 266-308, 1985). One format, the so-called "dot blot" 
hybridization, involves the non-covalent attachment of target DNAs to filter, which are 
subsequently hybridized with a radioisotope labeled probe(s), "Dot blot" hybridization gained 

30 wide-spread use, and many versions were developed (see M. L. M. Anderson and B. D. 
Young, in Nucleic Acid Hybridization - A Practical Approach, B. D. Hames and S. J. Higgins, 
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Eds., IRL Press, Washington, D.C. Chapter 4, pp. 73-1 11, 1985). It has been developed for 
multiple analysis of genomic mutations (D. Nanibhushan and D. Rabin, in EPA 0228075, July 
8, 1987) and for the detection of overlapping clones and the construction of genomic maps 
(G. A, Evans, in US Patent Number 5,219,726, June 15, 1993). 
5 New techniques are being developed for carrying out multiple sample nucleic acid 

hybridization analysis on micro-formatted multiplex or matrix devices (e.g., DNA chips) (see 
M. Barinaga, 253 Science, pp. 1489, 1991; W. Bains, 10 Bio/Technology, pp. 757-758, 1992). 
These methods usually attach specific DNA sequences to very small specific areas of a solid 
support, such as micro-wells of a DNA chip. These hybridization formats are micro-scale 

10 versions of the conventional "dot blot" and "sandwich" hybridization systems. 

The micro-fonnatted hybridization can be used to carry out "sequencing by 
hybridization" (SBH) (see M, Barinaga, 253 Science, pp. 1489, 1991; W. Bains, 10 
Bio/Technology, pp. 757-758, 1992). SBH makes use of all possible n-nucleotide oHgomers 
(n-mers) to identify n-mers in an unknown DNA sample, which are subsequently aligned by 

1 5 algorithm analysis to produce the DNA sequence (R. Drmanac and R. Crkvenjakov, Yugoslav 
Patent Application #570/87, 1987; R. Drmanac et al., 4 Genomics, 1 14, 1989; Strezoska et 
al,, 88 Proc. Natl. Acad. Sci. USA 10089, 1992; and R. Dmianac and R. B. Crkvenjakov, U.S. 
Patent #5,202,231, April 13, 1993). 

There are two formats for carrying out SBH. The first format involves creating an 

20 array of all possible n-mers on a support, which is then hybridized with the target sequence. 
The second format involves attaching the target sequence to a support, which is sequentially 
probed with all possible n-mers. Both formats have the fundamental problems of direct probe 
hybridizations and additional difficulties related to multiplex hybridizations. 



25 Southern, United Kingdom Patent Application GB 8810400, 1988; E. M. Southern et 

al., 13 Genomics 1008, 1992, proposed using the first format to analyze or sequence DNA. 
Southem identified a known single point mutation using PGR amplified genomic DNA. 
Southern also described a method for synthesizing an array of oligonucleotides on a solid 
support for SBH. However, Southem did not address how to achieve optimal stringency 

30 condition for each oligonucleotide on an array. 
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Concurrently, Drmanac et al., 260 Science 1649-1652, 1993, used the second format 
to sequence several short (1 16 bp) DNA sequences. Target DNAs were attached to membrane 
supports ("dot blot" format). Each filter was sequentially hybridized with 272 labeled lO-mer 
and 11 -mer oligonucleotides. A wide range of stringency condition was used to achieve 
5 specific hybridization for each n-mer probe; washing times varied from 5 minutes to 
overnight, and temperatures from 0%C to 16%C. Most probes required 3 hours of washing 
at 16%C. The filters had to be exposed for 2 to 18 hours in order to detect hybridization 
signals. The overall false positive hybridization rate was 5% in spite of the simple target 
sequences, the reduced set of oligomer probes, and the use of the most stringent conditions 
10 available. 

A variety of methods exist for detection and analysis of the hybridization events. 
Depending on the reporter group (fluorophore, enzyme, radioisotope, etc.) used to label the 
DNA probe, detection and analysis are carried out fluorimetrically, colorimetrically, or by 
autoradiography. By observing and measuring emitted radiation, such as fluorescent radiation 

1 5 or particle emission, information may be obtained about the hybridization events. Even when 
detection methods have very high intrinsic sensitivity, detection of hybridization events is 
difficult because of the background presence of non-specifically bound materials, A number 
of other factors also reduce the sensitivity and selectivity of DNA hybridization assays. 

One form of genetic analysis consists of determining the nature of relatively short 

20 repeating sequences within a gene sequence. Short tandem repeats (STR's) have been 
identified as a useful tool in both forensics and in other areas (paternity testmg, tumor 
detection, D. Sidransky, genetic disease, animal breeding). Indeed, the United States Federal 
Bureau of Investigation has announced that it is considering the use of short tandem repeat 
sequences for forensic purposes. (Dr. Bruce Budowle, DNA Forensics, Science, Evidence and 

25 Future Prospects, McLean, VA. Nov. 1 997). 

Various proposals have been made for identifying, amplifying, detecting and using 
polymorphic repeat sequences. For example, Tautz PCT WO90/04040-PCT/EP98/01203, in 
an application entitled "Process for the Analysis of Length Polymorphorisms in DNA 
Regions" (translated fi*om German), discloses a process for the analysis of length 

30 polymorphisms in regions of simple or cryptically simple DNA sequences. Tautz discloses 
a method which includes these steps of addition of at least one primer pair onto the DNA that 
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is to be analyzed, wherein one of the molecules of the primer pair is substantially 
complementary to. the complementary strands of the 5' respectively 3* flank of a simple or 
cryptically simple DNA sequence and wherein the addition takes place within orientation that 
is such that the synthesis products obtained from a primer controlled polymerization reaction 
5 with one of the two primers can be used, following denaturation, as matrices for the addition 
of the other primer, performing a primer-controlled polymerization reaction and separating, 
such as by normal gel electrophoresis the products and analyzing the polymerase chain 
reaction products. 

Caskey et al. at the Baylor College of Medicine also detected polymorphisms in a 

10 short tandem repeat by performing DNA profiling assays. In Caskey et al., U.S. Patent No. 
5,364,759, issued November 15, 1994, entitled "DNA Typing With Short Tandem Repeat 
Polymorphisms and Identification of Polymorphic Short Tandem Repeats" discloses a method 
including steps of extracting DNA from a sample to be tested, amplifying the extracted DNA 
and identifying the amplified extension products for each different sequence. Caskey required 

1 5 that each different sequence be differentially labeled. A physical separation was performed 
utilizing electrophoresis. 

C.R. Cantor and others more recently disclosed a technique for scoring short tandem 
DNA repeats. The method is disclosed in Yarr, R. et al., "In Situ Detection of Tandem DNA 
Repeat Length", Genetic Analysis: Biomolecular Engineering, 13(1996) 1 13-118, and PCT 

20 AppUcation W096/36731, PCT/US96/06527 entitled "Nucleic Acid Detection Methods". 
These disclose hybridization of an oligonucleotide target containing tandem repeats 
embedded in a unique sequence with a set of complementary probes containing tandem 
repeats of known lengths. Smgle-stranded loop structures result in duplexes containing a 
mismatched (defined there to be a different) number of tandem repeats. When a matched 

25 (defined there to be identical) number of tandem repeats existed on the duplex, no loop 
structure formed. The loop structures were digested with a single-stranded nuclease. 
Differential wavelength, such as through differentially colored fluoriflors of the various length 
probes identified where matched sites existed. No express use of electrophoretic separation 
was required in accordance with this method. 
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Despite the knowledge of the existence of polymorphism in repeat xmits now for 
approximately 15 years, as well as their known desirability for application in forensics and 
genetic testing, commercially acceptable implementations have yet to be achieved. 
Simmiarv of the Invention 
5 Methods and apparatus are provided for the analysis and determination of the nature 

of repeat xmits in a genetic target. In one method of this invention, the nature of the repeat 
units in the genetic target is determined by the steps of providing a plurality of hybridization 
complex assays arrayed on a plurality of test sites, where the hybridization complex assay 
includes at least a nucleic acid target containing a simple repetitive DNA sequence, a capture 

10 probe having a first unique flanking sequence and n repeat units, where n = 0, 1, 2 being 
complementary to the target sequence, and a reporter probe having a selected sequence 
complementary to the same target sequence strand wherein the selected sequence of the 
reporter includes a second unique flanking sequence and m repeat imits, where m= 0, 1 , 2..., 
but where the sum of repeat units in the capture probe plus reporter probe is greater than 0 

1 5 (n+m>0). In accordance with this method, the sequence of the capture probe differs at least 
two test sites. The hybridization complex assays are then monitored to determine 
concordance and discordance among the hybridization complex assays at the test sites as 
determined at least in part by hybridization stability. Ultimately, the nature of the repeat units 
in the target sequence may be determined based upon the concordant/discordant determination 

20 coupled with knowledge of the probes located in the hybridization complex at that site. 

By way of example, in implementation of this method, assimie that a target contains 
six repeat units. In a system simplified merely for expository convenience, the plurality of 
hybridization complex assays might be three assays arrayed on an APEX type bioelectronic 
system, wherein a first assay includes a capture probe having four repeat units (n=4), the 

25 second assay has a capture probe with five repeat xmits (n=5) and the third assay has captxire 
probes with six repeat xmits (n=6). If the reporter probe is selected to have one repeat xmit 
(m=l), the total number of repeat units at the first assay will be five (n+m=4+l=5), the total 
nxmiber of repeat xmits at the second assay will eqxial six (n+m=5-l=6), and the total mmiber 
of repeat xmits at the third assay will eqxial seven (n+m=6+l=7). The second test site will be 

30 the concordant test site since the nximber of repeat xmits in the target in this case eqxjals the 
nxxmber of repeat xmits in the capture plus the reporter probes, that is it is the test site with six 
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repeat xinits both in the target and in the combination of the capture probe and the reporter 
probe. Utilizing the knowledge regarding probe placement, the second test site is known to 
include a capture probe having five repeat units (n=5), such that when coupled with the 
knowledge of the reporter probe including one repeat unit, the total number of six repeat units 
5 in the target is determined. 

In the preferred embodiment of these inventions, electronically aided hybridization or 
concordance and discordance determination, or both, are utilized in the process. In one 
aspect, during the hybridization of the nucleic acid target with the capture probe and/or the 
reporter probe, electronic stringent conditions may be utilized, preferably along with other 

10 stringency affecting conditions, to aid in the hybridization. This technique is particularly 
advantageous to reduce or eliminate slippage hybridization among repeat imits, and to 
promote more effective hybridization. In yet another aspect, electronic stringency conditions 
may be varied during the hybridization complex stability determination so as to more 
accurately or quickly determine the state of concordance or discordance. 

1 5 In yet another aspect of this invention, a method is provided for the determination of 

the nature of the repeat units in a genetic target by providing a bioelectronic device including 
a set of probes arrayed at a set of test sites, the probes having a first unique flanking sequence, 
a second unique flanking sequence, and an intervening repeat xmit series having variable 
numbers of repeat units. The target is hybridized with the set of probes at the set of test sites, 

20 \mder electronic stringency hybridization conditions, and the concordance/discordance at the 
test sites is then determined. In the preferred embodiment, the concordance/discordance is 
determined at least in part through the use of electronic hybridization stability determinations. 
The concordant test site indicates which probe includes the nximber of repeat units identical 
to that in the target. In a variation of this embodiment, electronic stringency control is utilized 

25 only during the concordance/discordance determination. 

In yet another aspect of this invention, methods and apparatus are provided for the 
determination of target alleles which vary in size in a sample. A platform is provided for the 
identification of target alleles which includes probes selected fix)m the group consisting of (i) 
a probe having a first unique flanking sequence, an intervening repeat region and a second 

30 unique flanking sequence, and (ii) a sandwich assay comprising a capture probe having a first 
unique flanking sequence and 0,1,2... repeat units and a reporter probe having 0,1,2... repeat 
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units in sequence with a second unique flanking sequence. Thereafter, the target is hybridized 
with the probes, preferably under electronic stringent conditions so as to aid in proper 
indexing, or alternatively, utilizing electronic stringency conditions during subsequent steps, 
or using electronic stringency both during hybridization and at later steps, thereafter 
5 determining concordance and discordance at the test sites as determined at least in part by 
hybridization stability. 

In one aspect of the inventions, the location of the concordant test site represents the 
nature of the target sequence repeat units by the number of repeat units present in the target, 
and that in turn is based upon the knowledge of the probes located at that test site. Namely, 

1 0 the particular probes associated vwth a given physical test site typically will be known in terms 
of their sequence, especially including the number of repeat units, and the physical position 
of those test sites results in a knowledge for the concordant sites of the nature of the target, 
especially the number of repeat units. Typically, at a concordant test site, the number of 
repeat imits in the target eqxials the sum of the number of repeat units in the capture probe and 

1 5 the number of repeat units in the reporter probe. 

One advantageous aspect of the inventions is that the methods and apparatus are 
effective in detennining the presence of microvariants in the target sequence. Such 
microvariants may include one or more deletions, insertions, transitions and/or transversions. 
These may be for a single base or for more than a single base. Deletions or insertions within 

20 repeat imits can be detected by gel separation methods when using highly controlled 
conditions. This reqxiires single base resolution and is near the limit of detection for most gel 
separation techniques. For transitional or transversional mutations, the size of the allele 
doesn't change, even though the sequence has become altered. Conventional gel sieving 
methods have a very difficult time detecting these types of mutations, and recent findings by 

25 other investigators (Sean Walsh, Dennis Roeder, DNA Forensics: Science, Evidence and 
Future Prospects, McLean, VA. Nov. 1997) suggest that transitional and transversional 
mutations can cause subtle anomalies resulting in difficult gel analysis sometimes resulting 
in obfiiscation of STR analysis. Our method is an hybridization technique and is quite adept 
at reliably detecting single nucleotide polymorphisms as described above. Additionally, by 

30 designing specific capture and reporter oUgonucleotides these assays can be done on the same 
platform used to discriminate the nature of STR alleles by repeat unit niraiber. The general 
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strategy of designing capture oligos for microvariant analysis is the same as it is for integral 
repeat units, however reporter oligos may differ in that they may or may not contain unique 
flanking sequence. The condition of effectively determining concordance by maximizing the 
hybridization complex stability remains since oligo design parameters which yield base 
5 stacking (as described above) are still followed.. 

In yet another aspect of these inventions, various additional steps may be utilized in 
order to promote distinguishing concordant and discordant test sites. One mode of 
concordance may be that in which there is a complementary match of bases in the 
hybridization complex including the capture, reporter and target in the sandwich assay format. 

10 In yet another highly advantageous arrangement, the use of juxtaposed terminal nucleotides 
of the reporter and capture may be utilized, wherein their contiguous nature permits 
interaction, such as base stacking. Advantageously, the juxtaposed terminal nucleotide 
identities may be selected, as allowed by the existing repeat unit or otherwise relevant 
sequence, so as to increase the energy difference between concordance and discordance. It 

15 has been reported that base stacking between different bases varies in stability through an 
approximately 4-fold range (Saenger, Principles of Nucleic Acid Structure, 1984, Springer- 
Verlag, NewYork, NY). Experimental results have shown at least a ten-fold, and often times 
at least more than twenty-fold, improvement in discrimination ratios for the pairings 5'G-A3' 
versus 5T-A3*, when analyzed in our system. While this result is generally in concert with 

20 the published findings that 5'G-A3' base stacking provides greater stability than 5'T-A3' 
pairs, the differential stability increase seen with our assay greatly exceeds the reported 
values. It is highly beneficial that this invention exploits this natural condition to provide a 
superior assay advantage. In yet other embodiments, the terminal nucleotides may be 
modified to increase base stacking effects, such as with the addition of propynyl groups, 

25 methyl groups or cholesterol groups. In yet another related aspect, ligation techniques may 
be utilized, such as enzyme ligation or chemical ligation, so as to increase the energy 
difference between a concordant and discordant site. 

Discordance may be manifested in various ways, such as in the sandwich assay format 
wherein a gap or overlap exists, or in the loop out method where a loop out exists. Further, 

30 discordance may exist in the repeat region where there is a base variation, such as a deletion, 
insertion, transition and/or transversion. 
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In distinguishing concordant and discordant test sites, the distinction is preferably 
drawn in part based on hybridization stability. Hybridization stability may be influenced by 
numerous factors, including thermoregulation, chemical regulation, as well as electronic 
stringency control, either alone or in combination with the other listed factors. Through the 
5 use of electronic stringency conditions, in either or both of the target hybridization step or the 
reporter oligonucleotide stringency step, rapid completion of the process may be achieved. 
Electronic stringency hybridization of the target is one distinctive aspect of this method since 
it is amenable with double stranded DNA and results in rapid and precise hybridization of the 
target to the capture. This is desirable to achieve properly indexed hybridization of the target 
10 DNA to attain the maximum number of molecules at a test site with an accurate hybridization 
complex. By way of example, with the use of electronic stringency, the initial hybridization 
step may be completed in ten minutes or less, more preferably five minutes or less, and most 
preferably one minute or less. Overall, the analytical process may be completed in less than 
half an hour. 

15 

As to detection of the hybridization complex, it is preferred that the complex is 
labeled. Typically, in the step of determining concordance and discordance, there is a 
detection of the amount of labeled hybridization complex at the test site or a portion thereof 
Any mode or modality of detection consistent wdth the purpose and functionality of the 

20 invention may be utilized, such as optical imaging, electronic imaging, use of charge coupled 
devices or other methods of quantification. Labeling may be of the target, capture or reporter. 
Various labeling may be by fluorescent labeling, colormetric labeling or chemiluminescent 
labeling. In yet another implementation, detection may be via energy transfer between 
molecules in the hybridization complex. In yet another aspect, the detection may be via 

25 fluorescence perturbation analysis. In another aspect the detection may be via conductivity 
differences between concordant and discordant sites. 

In yet another aspect of these inventions, a redundant assay may be conveniently 
performed. In one implementation, a serial redundant assay may be utilized, such as where 
after an initial hybridization complex assay is perforaied, the stringency conditions are 

30 increased so as to effect denaturation, thereby removing the reporter fi:om the first 
hybridization complex assay. A second reporter may then be hybridized to the remaining 
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complex target and capture probe, wherein the second reporter includes a number of repeat 
units which differs from the number or type of repeat units in the first reporter. In this way, 
through the practice of the other steps as described for other applications, the physical test site 
at which concordance exists will have moved. The result is that a redimdant assay has been 
5 performed on the same device and sample material. 

Yet another redundant assay may be performed wherein multiple, e.g., two or more, 
independent sets of assays exist. A first reporter is hybridized to a first set of assays, and a 
second reporter is hybridized to a second set of assays, wherein the number of repeat units in 
the first reporter differs fi-om the number or nature of repeat xmits in the second reporter. 
10 Determination of concordance/discordance at the test site of the arrays, when coupled with 
the knowledge of the probes located as those test sites, provides two complexes firom the 
hybridization assays for confirmation of the target repeat nimiber or nature. 

The systems and methods of these inventions are particularly useful for determining 
the nature of complex samples, such as heterozygous samples, and mixed samples such as 
15 those from multiple sources or donors. In application, the methods and systems of these 
inventions may be utilized for a broad array of applications. Among them include 
identification, such as for paternity testing or for other forensic use. Yet another application 
is in disease diagnostics, such as for the identification of the existence of a clonal tumor, 
where the tumor includes repeat units of a nature or number different than the patient's 
20 imdiseased genetic state. 

Accordingly, it is an object of this invention to provide methods and systems for the 
rapid identification of the nature and/or number of repeat units in a polymorphic system. 

It is yet a further object of this invention to provide methods and apparatus which may 
effectively provide for genetic identification. 
25 It is yet a further object of this invention to provide systems and methods for the 

accurate detection of diseased states, especially clonal tumor disease states, neurological 
disorders and predisposition to genetic disease. 

It is yet a further object of this invention to provide a rapid and effective system and 
methods for identification, such as in forensics and paternity applications. 

30 

Brief Description of the Drawings 
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FIG. 1 A is a cross sectional view of one embodiment of an active matrix device useful 
in accordance with the methods of this invention. 

FIG. IB is a perspective view of an active array device useftil with the methods of this 
invention. 

5 FIG. 2 is a symbolic drav^ng of the components of a multiplex assay, including a 

target, a reporter and a capture sequence. 

FIGS. 3A, 3B and 3C are diagrammatic sketches of the multiplex assay including a 
target, reporter and capture sequence in which there is existing a gap, overlap and match, 
respectively. 

10 FIGS. 4 A, 4B and 4C show a detailed sequence listing for a multiplex assay of the 

THOl locus, evidencing a gap, an overlap and a match, respectively, corresponding to the 
diagranunatic representations of FIGS. 3 A, 3B and 3C. 

FIGS. 5A - 5G depict a diagrammatic view of a sequence of mxiltiplex assays showing 
a target with eight repeat units, a reporter with a single repeat unit and capture sequences 
15 including from four to ten repeat units, in FIGS. 5 A - 5G, respectively. 

FIG. 6 shows a plan view of an array of test sites for a sandvsdch assay, with the 
concordant test site depicted as shaded to represent the presence of hybridization complex. 

FIGS. 7A - 7G show a diagrammatic view of a mxiltiplex discrimination system, 
20 including a target having eight base repeat units, a reporter having two base repeat units, and 
a series of capture sequences including from four to ten repeat units in FIGS. 7 A - 7G, 
respectively. 

FIG. 8 shows a plan view of an array of test sites for a redundant assay with the 
concordant test depicted as shaded to represent the presence of hybridization complex. 
25 FIG. 9A shows a diagranunatic view of a target and hybridized reporter in a 

concordant condition. 

FIG. 9B shows a target and reporter in a discordant condition, namely in a loop 
condition. 

FIGS. lOA - lOG depict multiplex discrimination in a loop out system wherein a target 
30 includes seven repeat units and the reporters include from five to eleven repeat units in FIGS. 
lOA - lOG, respectively. 
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FIG. 1 1 is a graph of fluorescence (MFI) as a function of capture oligo repeat unit 
number in the identification of THOl target DNA alleles by the sandwich hybridization 
method. 

FIG. 12 is a graph of normalized fluorescence as a function of the number of repeat 
5 units in the capture sequence, where the reporter includes one repeat unit (left-hand side) and 
includes zero repeat units (right-hand side) showing a redundant reporter system for the THOl 
locus. 

FIG. 13 is a chart showing the specific olignucleotides utilized for capture sequences, 
reporter sequences and the target alleles in the THOl locus. 
10 FIG. 14 is a graph of discrimination ratios as a fimction of G-A stacking compared to 

T-A stacking in the paired left, right or graphs, respectively, for an eight x versus seven x 
discrimination (left-hand bars) and for a fifteen x versus fourteen x discrimination (right- 
hands bars) showing discrimination of targets by G-A and T-A stacking. 

FIG. 15 is a graph of discrimination ratio showing four couplets for chip one through 
1 5 chip four, respectively, showing maximum discrimination ratios utilizing a ten mer reporter 
(a left bar in couplet) and ten mer reporter with terminal propynyl group (right bar in couplet). 

FIG. 16 is a graph of normalized fluorescence intensity as a function of capture oligo, 
in a heteroxygeous TPOX locus. 

FIG. 17 is a table of the nucleotide sequences for the TPOX capture oligonucleotides, 
20 reporter oligonucleotides and target allele for the TPOX locus. 

FIG. 18 is a graph of fluorescence intensity (MFI/sec) as a function of number of 
capture repeat xmits for hybridization discrimination of CSFIPON alleles. 

FIG. 19 is a table of the capture oligonucleotides, reporter oligonucleotides and target 
alleles in the CSFIPO alleles. 
25 FIG. 20 is a graph of normalized intensity as a function of repeat unit number in the 

capture sequence in a THOl /TPOX multiplex analysis. 

FIG. 21 is a graph of relative fluorescence as a function of repeat imit number in the 
capture oligonucleotide in a system for the identification of repeat unit alleles in double 
stranded polymers chain reaction (PGR) amplified DNA including the target THOl locus. 
30 FIGS. 22A, 22B and 22C are graphs of the fluorescence (MFI) as a fimction of capture 

oligonucleotide having a gap (leftmost bar in triplet), match (center bar in triplet) or overlap 
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(right bar in triplet) for the initial signal, for the signal after three minutes of denaturation and 

for the signal after ten minutes of denaturation for FIGS. 22 A through 22C, respectively. 
FIG. 23 is a table showing the discrimination of match from mismatch/gap and 

mismatch/overlap for various combinations. 
5 FIG. 24 is a graph of percentage fluorescence as a fiinction of number of repeat units 

in the capture sequence for a determination of the nature and number of repeat units in allelic 

identity of target DNA utilizing the loop out analysis. 

FIGS. 25 A, 25B and 25C show a detailed sequence listing for a multiplex THOl 

microvariant assay, evidencing a gap, an overlap and a match, respectively. 
10 FIG. 26 is a graph of fluorescence intensity (MFI) as a fimction of capture 

oligonucleotide number or nature for the detection of the microvariant allele THOl 9.3. 

Detailed Description of the Invention 

Figs. 1 A and IB illustrate a simplified version of the active programmable electronic 

matrix hybridization system for use with this invention. Generally, a substrate 10 supports 
15 a matrix or array of electronically addressable microlocations 12. For ease of explanation, the 

various microlocations in Fig. 1 A have been labeled 12 A, 12B, 12C and 12D. A pemieation 

layer 14 is disposed above the individual electrodes 12. The permeation layer permits 

transport of relatively small charged entities through it, but limits the mobility of large 

charged entities, such as DNA, to keep the large charged entities from easily contacting the 
20 electrodes 12 directly during the duration of the test. The permeation layer 14 reduces the 

electrochemical degradation which would occur in the DNA by direct contact with the 

electrodes 12, possibility due, in part, to extreme pH resulting fix)m the electrolytic reaction. 

It fiirther serves to minimize the strong, non-specific adsorption of DNA to electrodes. 

Attachment regions 16 are disposed upon the permeation layer 14 and provide for specific 
25 binding sites for target materials. The attachment regions 1 6 have been labeled 1 6A, 1 6B, 1 6C 

and 16D to correspond with the identification of the electrodes 12A-D, respectively. 

In operation, reservoir 18 comprises that space above the attachment regions 16 that 

contains the desired, as well as undesired, materials for detection, analysis or use. Charged 

entities 20, such as charged DNA are located within the reservoir 18, In one aspect of this 
30 invention, the active, programmable, matrix system comprises a method for transporting the 

charged material 20 to any of the specific microlocations 12. When activated, a microlocation 
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12 generates the free field electrophoretic transport of any charged functionalized specific 
binding entity 20 towards the electrode 12. For example, if the electrode 12A were made 
positive and the electrode 12D negative, electrophoretic lines of force 22 would run between 
the electrodes 12A and 12D, The lines of electrophoretic force 22 cause transport of charged 
5 binding entities 20 that have a net negative charge toward the positive electrode 12A. 
Charged materials 20 having a net positive charge move under the electrophoretic force 
toward the negatively charged electrode 12D. When the net negatively charged binding entity 
20 that has been fimctionalized contacts the attachment layer 16A as a result of its movement 
under the electrophoretic force, the fimctionalized specific binding entity 20 becomes 

10 covalently attached to the attachment layer 16 A. 

Before turning to a detailed discussion of the inventions of this patent, the general 
matter of terminology will be discussed. The term "short tandem repeat" (STR) as used 
herein refers to a locus containing simple sequence motifs which are tandemly repeated the 
variable number of times at different alleles of that locus. A repeat unit or repeat xinits 

15 typically refers to individual simple sequence motifs which are repeated in a short tandem 
repeat. Repeat units may be, by way of examples, complete repeat units which contain 
identically repeating simple sequence motifs, or may be partial repeat units, such as where 
there is some difference between repeat units, such as in the existence of microvariants 
between repeat units. A concordant test site is taken to be a test site exhibiting a relative or 

20 local maxima of hybridization complex stability. By way of example, a concordant test site 
may be one wherein the number of repeat units in the target is equal to the number of repeat 
units in a capture plus the number of repeat xmits in a reporter probe for a multiple system, or 
wherein the number of repeat units in the target equals the number of repeat units in a probe. 
In yet another example of a concordant test site, if partial repeat units are present, a 

25 concordant test site may be manifested by a site where the repeat units in the target are 
substantially similar to the nature of the repeat units in the capture plus probe, or single probe, 
as appropriate. A discordant site, on the other hand, is a site exhibiting a relatively lower 
level of hybridization complex stability relative to at least one other site. Examples of test 
sites which typically would be termed discordant would be those where there exists a gap, 

30 overlap, point mutation (e.g., smgle base variation such as deletion, insertion, transition and 
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transversion), point mutations plus overlap, point mutations plus gap, single nucleotide 
variants or other microvariants. 

A hybridization complex assay in a multiplex system, such as in a sandwich assay, 
typically will include a target, a capture and a reporter. A hybridization complex assay in a 
5 loopout application includes at least typically a target and a probe. An array as used herein 
typically refers to multiple test sites, minimally two or more test sites. The typical number of 
test sites will be one for each allele of the locus. The number of loci required for a test will 
vary depending on the application, with generally one for genetic disease analysis, one to five 
for tumor detection six, eight, nine thirteen or more for paternity testing and forensics. The 

10 physical positioning of the test sites relative to one another may be in any convenient 
configuration, where linear or in an arrangement of rows and columns. 

Fig. 2 shows a symbolic drawing of the components of a multiplex assay. A target 30 
inclxides a first unique flanking region 32, a second unique flanking region 34 and one or more 
repeat units 36 disposed between the first unique flanking region 32 and the second unique 

15 flanking region 34. The target 30 may be a single or double stranded nucleic acid from 
specific loci, such as THO 1 . A reporter 40 includes at least a unique sequence region 42, and 
optionally includes one or more repeat units 44 disposed at the terminal end of the unique 
sequence region 42. The reporter 40 may have no repeat units 44, or may include one or more 
repeat imits 44. If the reporter 40 is to be labeled, a label 46 is associated therewith. 

20 Typically, the unique sequence region 42 of the reporter 40 is complementary to the second 
unique flanking region 34 of the target 30. The capture 50 includes a capture unique sequence 
region 52 and one or more repeat units 54 which are adjoined to the terminal end of the 
capture imique sequence region 52. If the capture 50 is to be attached to a solid support or 
other anchoring medixmi, an attachment element 56, such as biotin may be utilized. 

25 Figs. 3A, 3B and 3C are diagrammatic sketches of the multiplex assay including a 

target, reporter and capture sequence in which there exists a gap, overlap and match, 
respectively. For simplicity, the numbering of Fig. 2 will be adopted here for corresponding 
structures. In Fig. 3A, a gap condition exists. Broadly, the target 30 is hybridized to the 
reporter 40 and to the capture 50. More particularly, the second unique flanking region 34 is 

30 hybridized to the complementary strand comprising the unique sequence region 42 of the 
reporter 40. Similarly, the first unique flanking region 32 of the target 30 is hybridized to the 
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complementary capture unique sequence region 52 of the capture 50, The target 30 includes 
eight repeat units 36 in this example. The structure as best described applies equally to Figs. 
3A, 3B and 3C. Fig. 3A depicts a gap region 56, which results from the capture 50 having 
six repeat imits 54 and the reporter 40 having one repeat imit 44. Thus, the total number of 
5 repeat units 44, 54 in the reporter plus capture is seven, which is one less than the total 
number of repeat units 36 in the target 30. In Fig. 3B, an overlap condition is shown. Here, 
the capture includes eight repeat units 34, and the reporter 40 still includes a single repeat unit 
44. Here, the total number of repeat units 34, 44 between the capture 50 and reporter 40 is 
nine, exceeding the mmiber of repeat xmits 36 in the target 30, whereby one repeat unit is 

10 overlapped, here shown to be the repeat unit 44 associated with the reporter 40. Fig. 3C 
shows a match between the target 30 and the reporter 40 plus capture 50. There are seven 
repeat units 34 associated with the capture 50 and one repeat imit 44 associated with the 
reporter 40. Accordingly, the number of repeat units 36 in the target 30 equals the sum of the 
repeat units 34 in the capture 50 plus the number of repeat units 44 in the reporter 40. 

15 Figs. 4A, 4B and 4C show one example of specific nucleotide sequences 

corresponding to the examples of Figs. 3 A, 3B and 3C. Note that the left to right orientation 
of Figs. 3A, 3B and 3C is reversed left to right for Figs. 4A, 4B and 4C. Fig. 4A shows a gap 
condition wherein the gap 56 is disposed between the repeat unit 44 of the reporter 40 and the 
terminal repeat unit (5*CATT3') adjacent the gap 56 of the capture 50. In Figs. 4A, 4B and 

20 4C, the base repeat unit 36 of the target 30 is 5'AATG3', and accordingly, its complement 
base sequence is 5'CATT3' 44, 54. hi Figs. 4A, 4B and 4C, the nucleotides of the base repeat 
imit are shown in capital letters. This is done to designate those units in distinction to the 
nucleotides forming the first unique flanking region 32 and second unique flanking region 
34 of the target 30 as well as the unique region sequence 42 of the reporter 40 and the capture 

25 unique sequence region 52 of the capture 50. As can be seen, the nucleotides in the 
hybridized strands are paired, namely A-T and G-C pairs are complementary. Fig. 4B shows 
a mismatch with an overlap of one repeat unit. Specifically, the base repeat unit 44 
comprising the series 5'CATT3' is displaced from a hybridized condition with the base unit 
36 adjacent the second unique flanking region 34 of the target 30. As shown, the 5' terminal 

30 nucleotide (designated "c") of the unique sequence region 42 of the reporter 40 is shown as 
being slightly displaced from complete hybridization with the complementary "g" terminal 
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nucleotide of the second unique flanking region 34 of the target 30. This depiction is 
optional, and may also include the condition in which the terminal nucleotide of the imique 
sequence region 42 is in a hybridized condition with the terminal nucleotide of the second 
unique flanking region 34 of the target 30. Fig. 4C shows a match condition, where in this 

5 example, the nature of the repeat region, that is both number of repeat units partial and whole, 
and the complementarity of the sequence match., In the matched condition of Fig. 4C, the 5' 
terminal nucleotide "C" of the repeat unit 44 of the reporter 40 is adjacent and contiguous 
with the 3' terminal nucleotide "T" of the base repeat unit 54 of the capture 50, permittmg 
base stacking between the 5' "C" of the repeat unit 44 of the reporter 40 and the 3' "T" of the 

10 base repeat unit 54 of the capture oligonucleotide 50. The two base stacked nucleotides are 
underlined in Fig. 4C. In a preferred embodiment, the hybridization complex may anchored 
via the capture oligo 50 which would contain the appropriate attachment chemistry, preferably 
biotm at its 5' terminus. Also in a preferred embodiment, the hybridization complex would 
be labeled via the reporter oligo 40 with an appropriate molecule, preferably a chromophore 

15 at it's 3' end. 

The nature of a repeat unit is defined here as comprised of both the number of whole 
(or integral) repeat units and partial repeat units. Partial repeat units also known as 
microvariants or cryptically simple sequence, may be comprised of single nucleotide 
divergences from the most common repeat unit sequence. These divergences may consist of 

20 insertions, deletions, transition or transversion polymorphisms of the simple repeat sequence. 
Since all prior methods for the analysis of STR loci have relied on size, or the number of 
nucleotides, information on the firequency of transition and transition repeat unit 
polymorphisms is scant. However other investigators have recently recognized their 
significance and it is likely that methods which can efficiently detect them will be valuable 

25 (Sean Walsh and Dennis Roeder, DNA Forensics, Science, Evidence and Future Prospects, 
McLean, VA., Nov. 1997). 

Figure 25 demonstrates how this invention detects the presence of a common 
microvariant THOl 9.3. Two new entities are presented here: a microvariant target sequence 
70 containing the partial repeat unit 71 ATG and a microvariant reporter oligonucleotide 80 

30 which is complementary to 71 and the seven 5' adjacent nucleotides. Fig. 25 shows the 
relationship of the DNA subsequences when the nature of the target allele is nine whole repeat 
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units and one partial repeat, resulting in a matched concordancy. Elements vMch have been 
discussed before in Fig. 4A-4C have retained their numerical appellation, and novel features 
have been labeled with new numbers,. Therefore the target sequence 70 is made up of a first 
unique sequence 34, integral repeat sequence 36, second unique flanking sequence 32 and 
5 presents partial repeat sequence 71. The capture sequence 50 is identical to that described 
in Fig. 4A, with the exception that it has only three repeat units 54. The microvariant reporter 
80 is similar to reporter 40 in that it has repeat sequence 44 but differs by a lack of unique 
flanking sequence and by the inclusion of sequence 81 which is complementary to the target 
70 partial repeat xmit 71. The reporter 80 stabiUty is enhanced by two features. First it is 

10 complementary only to the microvariant region and second, it will base stack and therefore 
attain concordancy or a local maxima of stability only at the site which contains the 3ru 
capture oligo. One practiced in the art will realize how to apply this invention to microvariant 
sequences which differ from the THOl 9.3 sequence. Figure 26 demonstrates the 
effectiveness of this method. 

15 Selection of the adjacent or proximal nucleotides so as to increase the energy 

difference between concordant and discordant test sites is advantageously employed. A 
detailed discussion of such selections or modifications, such as in the use of terminal 
nucleotide base stocking, or modifications of terminal nucleotides such as with propynyl 
groups, methyl groups or cholesterol groups, or through the use of ligation techniques such 

20 as enzymatic ligation or chemical ligation are discussed fiirther, below. 

Figs. 5A-5G depict a diagrammatic representation of a multiplex system. The 
hybridization complex used in this system is sometimes termed a sandwich assay. Again, for 
expository convenience, the numbering adopted corresponds to that utilized with Fig. 2, Figs. 
3 A, 3B and 3C and Figs. 4A, 4B and 4C. In each of the depictions, a target 30 has a imique 

25 flanking region 32 and a second unique flanking region 34, with an intervening set of repeat 
units 36. In the example, the number of repeat units 36 is 8. The reporter 40 includes a 
unique sequence region 42 which is complementary to the second unique flanking region 34, 
and includes in this example one repeat unit 44 at the terminal end of the reporter. The 
cqjture 50 includes a capture unique sequence region 52 and, in this example, multiple repeat 

30 units 54 at the terminal end of the capture 50. The capture unique sequence region 52 is 
complementary to the first unique flanking region 32. 
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Fig. 5A shows a capture 50 with 4 repeat units 54. Since the sum of the number of 
repeat units 54 in the capture 50 plus the number of repeat units 44 in the reporter 40 (4+1) 
is less than the number of repeat units 36 in the target 30 (eight) a gap 56 exists. The gap as 
shown in Fig. 5A is substantially of the length of 3 repeat units 36, 44, 54. As a matter of 
5 terminology, the number of repeat units 54 in the capture 50 is sometimes denominated an '"N 
capture", where N equals the number of base repeat units 54 in the capture 50 plus the number 
of base repeat units 44 in the reporter 40. With this terminology, a match exists with a N 
capture where N equals the number of repeat units 36 in the target 30. Thus, in the example 
of Fig. 5D, wherein a match condition exists, a capture 50 having 7 repeat xmits 54 may be 

1 0 also denominated an "8 capture" since the capture 50 having 7 repeat units 54 when used with 
these selected reporter 40 having a single repeat unit 44 provides a match in that the total 
number of repeat units 44, 54 equals the number of repeat units 36 in the target. It will be 
appreciated by those skilled in the art that various naming or number conventions may be 
utilized to accurately describe the underlying arrangements, and it is those \mderlying 

15 arrangements which comprise the inventions herein, and not the naming or numbering 
conventions adopted. 

Fig. 5B shows a capture 50 having 5 repeat units 54, wherein a gap 56 of the length 
of substantially 2 repeat units 36, 44, 54 exists. Fig. 5C shows a capture 50 having 6 repeat 
units 54, wherein a gap 56 of substantially the length of a single repeat unit 36, 44, 56 exists. 

20 Fig. 5D shows a match condition with a capture 50 having 7 repeat units 54 and a reporter 
40 having a single repeat unit 44 equals the number of repeat units 36 in the target 30. 

Figs. 5E, 5F and 5G show an overlap condition. Fig. 5E shows a capture 50 with 8 
repeat units 54. As depicted, the reporter 40 repeat unit 44 is shown as being in a substantially 
non-hybridized condition with the target 30. Fig. 5F shows a capture 50 with 9 repeat units 

25 54, wherein a terminal repeat unit 58 of the capture 50 and the repeat unit 44 of the reporter 
40 are both in a substantially non-hybridized condition with respect to the target 30. Fig. 5G 
shows a capture 50 having 10 repeat units 54, such that the two terminal repeat units 58 of the 
capture 50 and the repeat unit 44 of the reporter 40 are in a substantially non-hybridized 
relationship with the target 30. 

30 Fig. 6 shows a plan view of an array of test sites for use in a multiplex assay, such as 

a sandwich assay. The concordant test site is determined to be at the site containing the 7 
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repeat xinit capture. This figure depicts an assay done with a 1 repeat unit reporter, therefore 
one can determine that the target must contain 8 repeat units since at the concordant site, the 
number of repeat units in the capture (7) plxis the number of repeat units in the reporter (1) 
equals 8. The depiction relates to the diagram of Fig. 5 in that it shows the results attained 
5 in the analysis of a DNA sample containing an eight repeat unit target with a one repeat unit 
reporter. 

Figs. 7A through 7G are diagrammatic depictions of a multiplex system, such as a 
sandwich assay, in which the reporter includes two repeat imits. This is in distinction to the 
assay of Figs. 5 A-G wherein the reporter included a single repeat unit. Again, for expository 

10 convenience, the numbering of earlier figures will be adopted to the extent of similarity. Fig. 
7A shows a target 30 having a first imique flanking region 32 and a second xmique flanking 
region 34. The reporter 40 includes a imique sequence region 42 and, in this example, two 
repeat imits 44. The capture sequence 50 includes a capture imique sequence region 52 and 
4 repeat units 54. Again, as a matter of nomenclature, the capture 50 may be referred to as 

15 a "5 capture", reflecting the terminology utilized in connection with the assay of Figs. 5 A-5G, 
namely those in which a reporter 40 having a single repeat unit 44 is utilized. 

Fig. 7B shows a multiplex system wherein the capture 50 includes 5 base imits 54. 
Each of the examples of Fig. 7A and 7B include a gap 56. 

Fig. 7C shows a match condition in that the number of repeat units 36 in the target 30 

20 is equal to the sum of the number of repeat units 54 in the capture 50 plus the repeat units 44 
in the reporter 40 (6+2=8). Fig. 7C depicts the concordant test site in that the match condition 
exists. Note that the efifective location of the concordant test site has shifted from Fig. 5D to 
Fig. 7C. This is a reflection of the change in the number of repeat units 44 in the reporter 40. 
Where a sequence or flight of unit incrementally longer captures 50 are utilized, a reporter 

25 40 being one base unit 44 shorter or longer will shift the physical location of the concordant 
test site, such as in Fig. 6 from test site 26D to 26C when going from a reporter of one repeat 
unit 44 to a reporter 40 having two repeat units 44. 

Figs. 7B-7G show an overlap condition. In Fig. 7D, one repeat unit 44 of the reporter 
40 is in a hybridized relationship with a repeat unit 36 of the target 30. A second repeat unit 

30 44* of the reporter 40 is in a mismatched, overlap condition with the complex. Fig, 7E shows 
an overlap condition wherein repeat units 60 are in a non-hybridized condition with the target 
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30. Fig. 7F and 7G also include mismatch arrangements including overlap wherein multiple 
repeat units 60 are in a non-hybridized condition with the target 30. The mismatch repeat 
units may either be those from the reporter 30 or from the capture 50, or a combination of 
both. 

5 Fig. 8 shows a plan view of an array of test sites for xise in a multiplex assay, such as 

a sandwich assay. The concordant test site is determined to be at the site containing the 6 
repeat unit capture. This assay depicts an assay done with a 2 repeat unit reporter, therefore 
one can determine that the target must contain 8 repeat units since at the concordant site, the 
number of repeat units in the capture (6) plus the nimiber of repeat xmits in the reporter (2) 

10 equals 8. The depiction relates to the diagram of Fig. 7A-7G in that it shows the results 
attained in the analysis of a DNA sample containing an eight repeat imit target with a two 
repeat unit reporter. In comparison with Fig. 6, one notes the change in the concordant test 
site location and confirmation of the target allele determination, when the target DNA is 
redundantly assayed with a second reporter oligonucleotide. 

15 Fig. 9A and Fig. 9B show diagrammatic views of a loopout embodiment of the 

invention. In the figures, a capture 60 includes a first unique flanking sequence 62, a second 
unique flanking sequence 64 and an intervening sequence of repeat imits 66 comprising one 
or more repeat units. A reporter 70 includes a reporter first unique flanking sequence 72, a 
reporter second unique flanking sequence 74 and an intervening sequence of repeat units 76. 

20 The number of repeat imits in the intervening sequence of repeat units 76 of the reporter 70 
may be the same as or different than the number of repeat units in the intervening sequence 
of repeat units 66 of the capture 60. A reporter label 78 may be included. 

Figs. lOA through 1 OG are depictions of multiplex systems having a variable length 
of repeat units. The numbering for Figs. 7-10 will adopt that from Figs. 9A and 9B to the 

25 extent practicable. In Fig. lOA, a capture 60 mcludes a first unique flanking sequence 62, a 
second unique flanking sequence 64 and an intervening sequence of repeat imits 66 having 
5 repeat units. The reporter 70 includes a reporter first unique flanking sequence 72, a 
reporter second unique flanking sequence 74 and a intervening sequence of repeat units, 
specifically, 7 repeat units. A mismatch or loopout condition exists given the different 

30 number of repeat units in the mtervening sequences 66, 76. Similarly, in Figs, lOB and lOD- 
lOG, a mismatch or loopout condition exists. Each of the component figures includes 7 base 
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repeat units in the reporter 70. A flight or sequence of monitonically increasmg number of 
repeat units in the intervening sequence of repeat units 66 for the capture 60 is depicted. Fig. 
lOB includes 6 repeat units, which still results with a mismatch, loopout condition in Fig. 1 OB 
where the excess repeat units in the intervening sequence 76 of the reporter 70 are looped out. 
5 In each of Figs. 1 OD-1 OG, the number or repeat units in the intervening sequence 66 in the 
capture 60 exceeds the number of repeat units in the intervening sequence 76 of the reporter. 
A loopout or mismatch condition then exists. In Fig. lOD, 8 repeat units exist within the 
intervening sequence of repeat units 64, which differs by one repeat unit from the number 
within the reporter intervening sequence of repeat units 76. In Fig. lOE, there are 9 repeat 

10 units in the intervening sequence of repeat units 64 in the capture 60. In Fig. lOF, there are 
1 0 repeat imits in the intervening sequence of repeat units 64 of the capture 60. In Fig. 1 OG 
there are 11 repeat \mits in the intervening sequence of repeat units 64 of the capture 60. 

In one mode, the hybridization complex is labeled and the step of determining 
concordance and discordance includes detecting of the amounts of labeled hybridization 

1 5 complex at the test sites. The detection device and method may include, but is not limited to: 
optical imaging, electronic imaging, imaging with a CCD camera and integrated optical 
imaging. Further, the detection, either labeled or unlabeled, is quantified, which may include 
statistical analysis. The labeled portion of the complex may be the: target, capture, reporter 
or the hybridization complex in toto. Labeling may be by fluorescent labeling selected from 

20 the group of but not limited to: Bodipy Texas Red, Bodipy Far Red, Lucifer Yellow, Bodipy 
630/650-X, Bodipy R6G-X and 5-CR 6G . Labeling may further be done by colormetric 
labeling, bioluminescent labeling and/or chemiluminescent labeling. Labeling further may 
include energy transfer between molecules m the hybridization complex by: perturbation 
analysis, quenching, electron transport between donor and acceptor molecules, the latter of 

25 which may be facilited by double stranded match hybridization complexes (See, e.g., Tom 
Meade and Faiz Kayyem, electron transport through DNA). Optionally, if the hybridization 
complex is unlabeled, detection may be accomplished by measurement of conductance 
differential between double stranded and non double stranded DNA (See, e.g., Tom Meade 
and Faiz Kayyem, electron transport through DNA). Further, direct detection may be 

30 achieved by porous silicon-based optical interferometry. 



wo 99/43853 PCT/US99/03175 

25 

The label may be amplified, and may include for example branched or dendritic DNA. 
If the target DNA is purified, it may be unamphfied or amplified. Further, if the purified 
target is amplified and the amplification is an exponential method, it my be, for example, 
PGR amplified DNA or SDA amplified DNA. Linear methods of DNA amplification such 
5 as rolling circle or transcriptional runoff may be used. Wherein target DNA is unpurified and 
unamplified or amplified, the amplification methods fiirther consisting of PGR and SDA for 
exponential amplification and rolling circle or transcriptional runoff for linear amplification. 

The target DNA may be from a source of tissue including but not limited to: hair, blood, 
skin, sputum fecal matter, semen, epithelial cells, endothelial cells, lymphocytes, red blood 
10 cells, crime scene evidence. The source of target DNA may include: normal tissue, diseased 
tissue, tumor tissue, plant material, animal material, mammals, humans, birds, fish, microbial 
material, xenobiotic material, viral material, bacterial material, and protozoan material. 

Wherein the target material is from cloned organisms (Ian Wilmut, Roslyn Institute, 
Edinborough) to determine degree of identity and level of genetic drift. 
15 Further, the source of the target material may include RNA. Further yet, the source of 

the target material may include mitochondrial DNA. 

EXAMPLES 

Example 1 - Identification of THOl Target DNA Alleles by the Sandwich Hybridiza 
20 tion Method 

The THOl locus contains the tetranucleotide repeat (AATG) present in five to eleven 
copy-numbers in a noncoding region of the Human Tyrosine Hydroxylase gene (ref). This 
locus is one of many commonly used and accepted by the forensics community for DNA 
fingerprinting. Figure 1 depicts data from an experiment designed to determine the identities 
25 of the alleles present in an unknown target DNA sample after analysis by the method 
described here. 

A silicon chip was prepared by spin coating onto the top of the electrodes an organic 
layer of agarose mixed with streptavidin, thus forming the permeation layer that serves as the 
underlying foundation for DNA attachment (See e.g., U.S. Application Serial No. 08/271,882, 
30 filed July 7, 1994, entitled "Methods for Electronic Stringency Control for Molecular 
Biological Analysis and Diagnostics", accord., Sosnowski et al., 1997, Proceedings of the 
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National Academy of Science USA). This peraieation layer is contiguous witii the electrode 
on one side and contiguous with the buffer containing the analyte on the other. Capture DNA 
specific for each THOl allele was then electronically addressed to individual sites on the spin 
coated chip, so that each test site is capable of detecting a different THOl allele. Sequences 
5 for the capture oligos are listed in Table 1 . The capture oligos were electronically addressed 
in 50 mM histidine buffer at its natural pH -5.4, at a concentration of 500 nM. Pads were 
biased positive 5 at a time and a current source of + 4.0 microamps ( A) was applied for 38 
milliseconds (ms). The polarity of the field was then reversed and - 4.0 A was applied to the 
5 pads for 25 ms. This cycle was repeated 500 times for a total electronic addressing time of 
10 --30 seconds. Under these conditions, the biotin moiety of the capture oligo reacts with the 
streptavidin in the permeation layer over the activated test site to immobilize the capture oligo 
at that site. 

A mix of complementary target DNA, composed of THOl alleles 5 and 9 (Table 1), was 
then electronically hybridized to each of the sites containing addressed capture DNA. The 

15 electronic hybridization was done in low conductivity zwitterionic buffer at a temperature 
empirically determined to promote nonslippage hybridization. Due to the nature of electronic 
hybridization, specifically the low conductivity buflFer (Edman, et al.. Nucleic Acids Research, 
1997), high stringency hybridization can be attained at lower temperatures than conventional 
nonelectronic hybridization. Experiments for THOl analysis were usually performed at 34- 

20 42% C. The target DNA was electronically hybridized in 50 mM histidine buffer at its 
natural pH --5.4, at a concentration of 5-125 nM. The programmed electronic protocol 
included the following steps. Pads were biased positive 5 at a time and a current source of + 
4.0 microamps ( A) was applied for 19 milliseconds (ms). The polarity of the field was then 
reversed and - 4.0 A was applied to the same 5 pads for 12 ms. This biased-AC cycle was 

25 repeated 500 times for a total electronic addressing time of --30 seconds. 

This experiment has also been done by passive, nonelectronic hybridization at high 
stringency conditions, but with much longer incubation times (50 mM NaP04 buffer, pH 7.0, 
60% C, 30-60 minutes, results not shown). 

The stringency of this hybridization step is critical due to the malleable nature of the 

30 tetranucleotide repeat complementary alignment. It is quite easy to obtain stable hybrids 
without aligning the flanking unique sequences, since the length of the repeat region is 20-44 
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bases. The out of register hybrids formed by insufficient stringency will not be accurately 
distinguished by any hybridization assay. High stringency hybridization can be attained at 
relatively low temperatures with electronic hybridization because of the low conductivity of 
the buffer and resulting low shielding of the repulsive negative charges on the DNA 
5 backbone. Electronic concentration of DNA overcomes these repulsive eflfects while 
maintaining highly stringent hybridization conditions. 

Reporter DNA, 1 Repeat Unit ( Fig. 13, 500 nM in 50 mM NaP04, 500 mM NaCl, pH 
7.0) was then passively hybridized to the capture-target complexes formed by the above steps. 
Reporter hybridization was most stable at those test sites where the target directs 

1 0 hybridization to provide a juxtaposition of the terminal nucleotides of the capture and reporter 
oligo. This additional stability is due to the base stacking of the terminal nucleotides. This 
juxtaposition will be 5*-3' or 3'-5' depending on the position of the attachment chemistry on 
the capture oligo. Unstable configurations would be a four base (or greater) gap between the 
capture and reporter or a four base (or greater) overlap of the capture and reporter ( See, e.g., 

15 Figs. 3A-3C, and 5A-5G). 

After reporter hybridization, the DNA loaded chip is washed several times with 50 mM 
NaP04, pH 7.0 at ambient temperature. The temperature of the hybridized organo-chip is then 
increased to 30% C and fluorescence levels at each test site are recorded at one minute time 
intervals. The fluorescent values are digitized by a computer program (IP Lab) as mean pixel 

20 intensity. A specific area over each pad is selected, and the pixel intensity for each site is 
stored for analysis. The histogram in Figure 1 displays the mean pixel intensity at each test 
site immediately after the denaturation step is complete. 

Fig. 1 1 shows a graph of the fluorescence as a fimction of number of repeat units. These 
results show that a heterozygous mix of THOl DNA can be resolved into match (concordant) 

25 and mismatch (discordant) hybrids, with the match hybrids representing the identity of the 
alleles present in the DNA sample. All possible homozygote and heterozygote THOl STR 
allelic combinations (5+6, 5+7, 5+8 etc.) have been analyzed by the chip format, such as 
shown in Figs. 1 A and IB, with similar excellent levels of discrimination among alleles. 
Example 2 - Reanalysis of Target DNA with Redundant Reporters 

30 One Repeat Unit reporter was denatured from the match sites of the chip described in 

the preceding example by increasing the temperature ^-50% C, conditions which do not 
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denature the target from the capture. This chip was then rehybridized with the Zero Repeat 
Unit reporter ( See, e.g.. Fig. 5). This shifts the position of stable sandwich complexes from 
the 4 and 5 sites (See, Figure 12, left hand side, One Repeat Unit Reporter) to the 5 and 6 sites 
(Figure 1 2, right hand side, Zero Repeat Unit Reporter). Using the formula that the number 
5 of repeat units in the capture plus the number of repeat units in the reporter equals the number 
of repeat xmits in the target, we find that the target DNA in this case had a heterozygous mix 
of the 5 and 6 alleles of THOl. The reanalysis confirms the identity of the alleles present in 
the target DNA with a second oligo sequence. This redimdant analysis increases the 
significance of the assay result since it is essentially a new interrogation of the target DNA 
1 0 with an oUgo that has a different sequence. Using a different sequence reduces the possibility 
of artifactual results due to oligo secondary structure or other sequence-related anomalies. 
Therefore, the use of additional oligos for target analysis reduces the possibility of false 
positive and/or negative results. 

The above protocol was repeated with the Two Repeat Unit reporter ( See, e.g., Figs. 
15 7A-7G), to shift the location of stable match hybrids to a third test site. This additional 
reiteration of the STR analysis ftulher strengthens the robustness of the assay. 

Example 3 - Selection and/or Modifications of Terminal Nucleotides to 

Increase Base Stacking Effect 

Base stacking is dependent on the interactions of the ring structure of one base with the 
20 base ring of its nearest neighbor. The strength of this interaction depends on the type of rings 
involved, as detemained empirically. While applicants do not wish to be bound by any theory, 
among the possible theoretical explanations for this phenemon are the number electrons 
available between the two bases to participate in pi bond interactions and the efficiency of 
different base combinations to exclude water from the interior of the helix, thereby increasiug 
25 entropy. Although the above models are consistent with current data, the possible mechanisms 
of stacking interactions are not limited to these concepts. 

It has also been observed that modification of bases involved in base stacking 
interactions can strengthen pi bonding, or stacking, between them. As one might predict from 
the models described above, these modifications provide more electrons for use in Pi bonding 
30 and/or to increase the surface area of the rings thereby increasing the area of hydrophobicity 
between the stacked bases. 
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Figure 14 demonstrates an example of these models as applied to this invention. Initial 
experiments with the CSFIPO locus used an A and T astheterminal nucleotides to provide 
discriminating base stacking. References indicate that A-T base stacking interactions are the 
least stable of all nucleotide combinations. Therefore we altered the design of the capture and 
5 reporter oligos to make G and A the terminal nucleotides, since this is reported to be a much 
more stable conformation. The experiment was done by the method described in Example 1 , 
with the exception that the locus examined was CSFIPO. To compare the base stacking 
contributions of different juxtaposed contiguous terminal nucleotides, an additional set of 
CSFIPO capture and reporter were designed to change the terminal nucleotides from T-A to 

10 G-A. Figure 15 compares discrimination of match from mismatch hybrids containing either 
A-T or G-A terminal nucleotides. The results are displayed as discrimination ratios, that is 
the Mean Fluorescent Intensity (MFI) of the concordant site divided by the average MFI of 
the discordant sites. One sees that the Discrimination Ratios mcrease from about 2.5 to about 
25 when G-A terminal nucleotides are used rather than T-A terminal nucleotides. These data 

1 5 demonstrate that this system can be modulated in a manner predicted by base stacking theory, 
as well as earlier observations, thereby underscoring the mechanism of the invention as 
dependent on Pi bonding between juxtaposed bases. 

In addition to taking advantage of the naturally selected base stacking interactions, it 
may be predicted that base modifications which increase the number of electrons in the ring 

20 or enlarge the hybdrophobic area woidd also increase discrimination of match from mismatch 
hybrids. This was tested by synthesizing THOl reporter oligos whose 5* terminal nucleotide 
contained a propynyl group attached to the ring of the base. This modification would be 
predicted to increase base stacking by either of the increased electronorhydrophobicity models 
described above. Figure 15 shows match/mismatch results in a direct comparison of a THOl 

25 reporter with or without a propynyl-modified terminal base. This experiment was done as 
described in Example 1 with the THOl locus. The data are again presented as discrimination 
ratios. In 4 separate experiments, enhanced stability is observed in complexes containing 
reporters with propynyl-modified reporters. The average increase in discrimination ratios was 
95%. The results show that again, this system can be manipulated in a predictable fashion. 

30 This concept could be carried further by adding other analogs such as methyl of cholesterol 
groups. Techniques for adding these types of modifications are known (e.g. Gryaznov). 
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These modifications could be used to further stabilize the binding of the reporter to the 
concordant by linking the modifying molecules together. One example of this is taught by 
Gryaznov in the use of cholesterol at both terminal nuclei and the addition of a cholesterol 
bindmg molecule, such as low density lipoprotein (LDL). This would result in a complex at 
5 the concordant site which consists of target, cholesterol-modified capture, cholesterol- 
modified reporter and LDL. 

Example 4 - Hybridization Detection of TPOX alleles 

The TPOX locus contains the tetranucleotide repeat (AATG) present in six to thirteen 
copy-numbers in a noncoding region of the Human Thyroid Peroxidase gene (See, e.g., 

10 Anbach et al., 1996, Advanced in Forensic Haemogenetics). This locus is also one of many 
commonly used and accepted by the forensics community for DNA fingerprinting. The 
sequences are provided in Fig. 17. 

Figure 16 depicts data from an experiment where target DNA containing the TPOX 8 
and 1 1 alleles was analyzed by a procedure nearly identical to that described in Example 1, 

15 OUgo capture DNA containing all allelic possibilities was electronically addressed to 
individual sites on the chip. A nux of complementary target DNA, composed of TPOX alleles 
with 8 and 1 1 STRs, was then electronically hybridized to each of the pads containing 
addressed capture DNA. The conditions for electronic hybridization were the same as those 
outlined in Example 1 . One Repeat Unit Reporter oligo was then passively hybridized to the 

20 array and treated in the manner of the THOl example. Figure 16 shows a stable hybridization 
complex at the test sites containing capture oligos with 7 and 10 repeat units. Since the 
reporter oligo has one repeat unit, the target DNA can be identified as having 8 and 1 1 repeat 
units. 

The results show that a mix of TPOX 8 and 1 1 STR DNA can be unequivocally 
25 discriminated firom all mismatches. Further, all other homozygous and heterologous TPOX 
combinations analyzed yield compamble discrimination. 

Example 5 - Hybridization discrimination of CSFIPO alleles 
Capture oligos containing CSFIPO alleles 7 through 15 (Fig. 19) inclusive were 
30 electronically addressed to representative sites as previously described. Target DNA 
containing CSFIPO 1 1 and 12 alleles (Fig. 19) was then electronically hybridized to each of 
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the sites. The CSFIPO one repeat unit reporter ( Fig, 19) was then. Denaturation of the 
reporter was done at 30% C. Figure 1 8 shows the mean pixel intensity at various capture sites 
after the assay, demonstrating the ability of the assay to correctly discriminate the alleles 
present in the target sample. The experiment was done as described in Example 1. 
5 Example 6 - THOl/TPOX Multiplex Analysis 

Locus-allele specific capture oligos were individually addressed to different sites on a 
single chip. The DNA chip containing capture oligos was then hybridized with a mixture of 
THOl and TPOX target DNA containing heterozygote alleles. The chip was then washed and 
analyzed by the hybridization assay of the form described previously. The above steps were 

10 performed as described in Example 1. Relative fluorescent levels were xised to determine 
whether sites contain concordant or discordant DNA hybrid complexes. Both reporters used 
contained one repeat unit. 

The results of Fig. 20 showed that under the assay conditions, 7 and 9 STR alleles of 
THOl hybridized very well with their cognitive capture sites. Hybridization to other capture 

15 alleles was not detectable (5x, 7x, 9x and lOx), indicating an excellent discrimination of 
THO 1 7/9 heterozygote. For the TPOX locus, we also obtained a good matched capture/target 
interaction (sites 9x and 1 Ix). Further, the stability of the discordant hybridization complexes 
formed with the 10 and 12 STR targets was so low that the complexes were either 
undetectable (7xc and 12xc), or low enough to yield a discrimination ratio of 15 fold or higher 

20 (lOxc and 8xc respectively), resulting in easy discrimination of TPOX 10/12 heterozygote 
target. 



Example 7 - Identification of STR Alleles in Double Stranded PCR-amplified DNA 
25 This experiment was done to determine the utility of the current invention as applied to 

interrogation of double-stranded DNA generated by PGR amplification. Figure 6 provides 
an example of the ability of our system to accurately identify PGR generated targets. 

The TPOX 1 locus was PGR amplified xising a genomic template from the K562 cell line 
following standard conditions outlined in the Promega STR User's manual (3). The genotype 
30 of K562 is heterozygous for the 8 and 9 repeat alleles. Following amplification the amplicon 
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was denatured at 95% C and hybridized to a Nanogen APEX chip. As previously discussed, 
the chip had capture probes unique for PGR products containing each number of repeat length. 

The technical aspects of this experiment were identical to those described in Example 
I and 4, with the exception of the use of double-stranded, PGR amplified DNA as the target. 
5 Figure 21 shows the relative amount of signal present on the positive (8G, 9G match) 

and negative (7G, IOC mismatch) after the experiment has been performed. As seen in 
example 4 the level of discrimination attainable ranges fi'om 20-fold to infinite. Similar 
results have been obtained using CSFl and THOl firom both K562 control DNA and genomic 
DNA isolated firom anonymous donors. These results suggest that the current invention is 
10 generally applicable to all double-stranded DNA, whether amplified by PGR or other 
technology, including the potential for analysis of xmamplified DNA. 

Examples - Multivariant Detection 

In the examples listed previously, detection has been accomplished by direct fluorescent 
labeling of the reporter or reporter/target DNA. One embodiment would be fluorescence 

1 5 perturbation where quencher and reporter chromophores are positioned proximal to each other 
such that fluorescence is quenched. See, e.g., (Methods for Hybridization Analysis Utilizing 
Electrically Controlled Hybridization and Methods For Electronic Fluorescent Perturbation 
for Analysis and Electronic Perturbation Catalysis for Synthesis), all incorporated by 
reference as if fiilly set forth herein. 

20 Oligo synthesis and conjugation methods and materials are commonly practiced. In 

brief, the capture probe would have attachment moiety, such as biotin, at one end and a 
chromophore at the distal terminus or the terminus which extends into the STR region. 
During DNA synthesis linker arms or spacers would be incorporated at the appropriate 
location internally or at the terminus. These linker arms would have a fimctional group to 

25 which chromophores could later be conjugated, such as amino linker and succinimidyl ester 
chromophore. The reporter probe would have a different chromophore incorporated in the 
same manner, at the end which extends into the STR region. Thus, in the presence of target 
the capture and reporter probes would hybridize and position the chromophores proximal to 
one another. The distance between the chromophores would be determined by the spacer 

30 length and where the chromophore was attached to the DNA, via the base, backbone, or sugar. 
Example 9 - Target -Dependent Ligation of Capture and Reporter 



wo 99/43853 PCT/US99/03175 

33 

An additional embodiment of this invention would be to further stabilize the attachment 
of the reporter to the concordant test site by ligating the reporter to the preattached capture. 
This would result in a covalent bond between the capture and the reporter with the capture 
being held at the site by biotin-streptavidin interaction. The critical part of this embodiment 
5 would be to accomplish the ligation in a selective manner, maintaining the ability to 
discriminate match from mismatch hybrids. This could be done by careful maintenance of 
hybridization stringency, by electronic or conventional methods. 

Ligation could be achieved enzymatic (Maniatis et al., Molecular Cloning, a Laboratory 
Manual, 1982) or chemical methods (Gryaznov, Nucleic Acids Research, 1994). Selection 
10 of the method could be determined by the kinetics involved with the specific type of reaction 
as well as the overall efficiency of taking a particular method into a product. 



Example 10 - Discriminating Match from Mismatch/Gap and Mismatch/Overlap 
1 5 The ability of the system to discrimmate not only match from mismatch hybrids, but also 

to discern between the two types of mismatches, gap and overlap increases the utility of the 
method. Figs. 22A, 22B and 22C show graphics of fluorescence intensity for a gap, match 
and overlap condition (bar charts from left to right), for the initial signal (Fig. 22A) after three 
minutes of denaturation (Fig. 22B) and after ten minutes of denaturation (Fig. 22C). This 
20 feature of the technology provides additional information about the target DNA, that is, 
information regarding all three types of possible hybrids. This additional information can be 
used in several ways. 

First, it may be possible to reduce the number of pads required for accurate identification 
of target DNA by taking advantage of this feature. Fig. 23 shows the potential for this feature 

25 in use with the THOl locus. It predicts that correct identification of all THOl alleles can be 
achieved with a set of capture oligos which have five, seven and nine repeat units in 
combination with the one repeat unit reporter THOl locus. This reduces the reqxiired number 
of analytical test sites from seven to three. This feature, when combined with the ability to 
do redundant reporters, could greatly reduce the number of pads required for the analysis of 

30 a set of loci for statistically significant genotyping. Currently this level is approximately 10 
loci. The beneficial effect of this would be to permit more loci on a single chip, and therefore 
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with larger arrays, the ability to assay mxiltiple individuals on the same chip thereby reducing 
the cost of the assay. This would be especially useful for high throughput processing, as will 
be requured for the STR databasing of felons currently under way. 

Even without a reduction in the number of test sites needed to assay STR alleles, the 
5 additional information gained from distinguishing gap mismatch from overlap mismatch will 
aid in the accuracy of the assay. Any additional information could be incorporated into the 
ultimate statistical analysis of the data to provide an answer which has a higher probability 
of being accurate. 

Example 11 - Determination of STR Allelic Identity of Target DNA by Loopout 
10 Analysis 

In another embodiment of STR allele discrimination by hybridization, we have 
demonstrated that a different oligonucleotide system can be used. This method, designated 
the loopout system, is outlined in Figure 9A and 9B, and Figs. lOA-lOG. It is evident from 
the drawing that this is an alternative to the sandwich method of identifying alleles in a 

1 S matrix. The loopout system uses an array of capture oligos which are distributed in a similar 
manner to the sandwich format. The structure of the capture oligos differs from ones in the 
sandwich format by the presence of locus-specific unique sequence flanking both ends of the 
repeat region. Also, the target DNA is labeled and serves as the reporter molecule. The target 
can be labeled during amplification by using a PGR primer which is fluorescent (or contains 

20 any other suitable molecular adaptation for detection). 

In practice, loopout capture oligos which are specific to different alleles of a locus are 
arrayed in a matrix such that individual test sites represent different alleles (See, e.g., Figure 
6). Actual test results are shown in Fig. 24. Labeled target oligo is then stringently 
hybridized under electronic conditions to the entire array. The test site with the most stable 

25 hybrids will be concordant with the allelic identity of the target DNA. Determining the 
position of the stable hybrids (and therefore the allele-specific capture oligo attached to it) 
identifies the alleles represent in the target DNA. Hybrids formed between capture and 
reporter oUgos vMch have the same number of repeat units are matched. It may also be said 
that this test site is concordant with the identity of the target allele. Test sites which are 

30 discordant or contain an unequal number of repeat units between the capture and the target, 
will form a hybrid with a loop in either the target or capture DNA (See, e.g.. Figs. lOA, lOB 
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and lOD-lOG). Discordant sites which have captures with fewer repeat units than the target 
will yield hybrids with loops in the These hybrids will be inherently less stable than the 
match hybrids. Therefore denaturation by electronic stringency control will discriminate 
stable from less stable hybrids indicating the sites of concordancy, and based upon the 
5 knowledge of the probes present at those sites, enable the user to determine the number of 
repeat units in the target DNA, 

Example 12 - Detection of Microvariant Allele THOl 93 

As STR's become more widely used, deviations within the repeat regions are being 
discovered with greater frequency. This can be troublesome to conventional size 
10 fractionation methods since the margin for discrimination goes from four bases down to one 
base. This is in the case of an insertion or deletion mutations. For transitional or 
transversional mutations, the size of the allele doesn't change, even though the sequence has 
become altered. 

Both these classes of mutations, insertion/deletion and transitional/transversional can be 
1 5 readily detected by our technology. This is primarily due to the fact that Nanogen's approach 
is a hybridization based assay rather than a sizing method. Therefore combining terminal 
nucleotide base stacking with single nucleotide polymorphism provides a powerful 
discriminatory tool. 

One well known STR microvariant is the THOl 9.3 allele. It is important because it is 
20 present in a significant portion of the Caucasian popxilation. The assay for the 9.3 
microvariant was essentially the same as the normal STR alleles but required special capture 
and reporter oligo design. The capture oligo contains only three repeat imits (3ru, Fig. 13). 
This is because the single base deletion is between repeat units 6 and 3 on the target strand, 
THOl 9.3 target DNA binding to the capture will be less stable at capture sites containing 
25 greater than three repeat units because there will be a frame-shift in the repeat region of 9.3. 
The reporter oligo (Microvar 9.3, Fig. 13) has been designed so that it will bind most stably 
to the repeat unit region containing the deletion. Additionally, the capture oligo has been 
designed so that target directed base stacking of the capture and reporter DNA will occur only 
at the 3ru test site. Figure 25 shows a detailed sequence alignment of the capture and reporter 
30 oligonucleotides used to detect the THOl 9.3 microvariant The numbering in the drawing is 
consistent with that found in Figure 4C, with the addition of sequence 45 which is 
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complementary to the partial repeat unit which constitutes the microvariant. This particular 
microvariant reporter has no second unique flanking sequence 42. This is necessary for 
analysis of THOl 9,3 allele not be a feature of other microvariant-specific reporters. It is 
evident from this drawing in the hybridization complex which is concordant for the THOl 9.3 
5 allele, there exists sequence complementarity between the target and reporter DNA as well 
as base stacking between the capture and reporter oligonucleotides. Figure 26 shows the 
results of an assay of PCR amplified DNA from an individual homozygote for the THOl 
allele. 

Although the foregoing invention has been described in some detail by way of 
1 0 illustration and example for purposes of clarity and understanding, it may be readily apparent 
to those of ordinary skill in the art in light of the teachings of this invention that certain 
changes and modifications may be made thereto without departing from the spirit or scope of 
the appended claims. 



wo 99/43853 PCT/US99/03175 

37 

We Claim: 

1. A method for determining the nature of repeat imits in a genetic target, 
comprising the steps of: 

providing a plurality of hybridization complex assays arrayed on a plurality of 
5 test sites, where the hybridization complex assay includes at least: 

a nucleic acid target containing repetitive DNA sequence, 
a capture probe having a first unique flanking sequence and n repeat 
units, where n > 0, complementary to the target sequence, and 

a reporter probe having a selected sequence complementary to the same 
10 target sequence strand, the reporter including attributes selected firom the 

following group: 

a) wherein the selected sequence of the reporter includes a 
second unique flanking sequence, 

b) wherein the selected sequence of the reporter includes 
1 5 sequence complementary to a microvariant region of the 

target sequence, and 

c) wherein the selected sequence of the reporter includes a 
second unique flanking sequence complementary to a 
microvariant region of the target sequence, 

20 wherein the sxmi of repeat units in capture plus reporter is greater than 

zero, 

the sequence of the capture probe differing at at least two test sites, 
determining concordance and discordance among the hybridization complex 
assays at the test sites as determined at least in part by hybridization stability and 
25 determining the nature of the repeat units in the target sequences based upon 

determination of the concordant site and further based upon the knowledge of the probes 
located in the hybridization complex at that site. 
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2. The method of claim 1 wherein the location of the concordant test site represents 
the nature of the target sequence repeat units by the number of repeat units present in the 
target, based upon the knowledge of the probes located at that test site. 

5 3 The method of claim 2 wherein at the concordant test site, the munber of repeat 

units in the target eqxials the sum of the number of repeat xmits in the capture probe and the 
number of repeat units in the reporter probe. 

4. The method of claim 1 wherein the location of the concordant test site represents 
10 the nature of the target sequence repeat units by the detection of microvariants in the target 

sequence 

5. The method of claim 4 wherein the microvariants include one or more selected 
from the group comprising: deletions, insertions, transitions and transversions. 

15 

6. The method of claim 5 wherein the microvariant affects a single base. 

7. The method of claim 5 wherein the microvariant affects more than a single base. 

20 8. The method of claim 1 wherein the location of the concordant test site represents 

the nature of the target sequence repeat units by the number of repeat miits and by the 
detection of microvariants in the target sequence. 

9. The method of claim 8 wherein the microvariants include one or more selected 
25 from the group comprising: deletions, insertions, transitions and transversions. 

1 0. The method of claim 9 wherein the microvariant affects a single base. 



1 1 , The method of claim 9 wherein the microvariant affects more than a single base. 

30 



wo 99/43853 PCT/US99/03175 

39 

12. The method of claim 1 wherein a test site is deemed to be Concordant when the 
hybridization complex present there is stable relative to other test sites for the same locus. 

13. The method of claim 12 wherein stability is enhanced by complementary match 
5 of bases in the hybridization complex includmg the capture, reporter and target. 

14. The method of claim 12 wherein stability is enhanced by juxtaposed terminal 
nucleotides of reporter and capture being contiguous to permit base stacking. 

10 15. The method of claim 14 further including selection of juxtaposed terminal 

nucleotides to increase energy difference between concordance and discordance. 

16. The method of claim 15 wherein the selection keyed at least in part to base 
stacking. 

15 

17. The method of claim 16 wherein the base stacked pair is 5'GpC3'. 

1 8. The method of claim 1 6 wherein the base stacked pair is 5TpA3'. 

20 19. The method of claim 1 2 further including modifications of terminal nucleotides 

which increase base stacking. 

20. The method of claim 19 wherein the terminal nucleotide is modified with 
propynyl groups. 

25 

2 1 . The method of claim 1 9 wherein the terminal nucleotide is modified with melhy 1 
groups. 



22. The method of claim 19 wherein the terminal nucleotide is modified with 
30 cholesterol groups. 
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23 . The method of claim 12 wherein stability is enhanced with ligation techniques. 

24. The method of claim 23 wherein the ligation is enzymatic ligation. 

5 25. The method of claim 23 wherein the ligation is chemical ligation. 

26. The method of claim 1 wherein discordance includes a gap between the capture 
and the reporter. 

1 0 27. The method of claim 1 wherein the discordance includes an overlap between the 

capture and the reporter. 

28. The method of claim 1 wherein discordance includes a base variation in the 
repeat region. 

15 

29. The method of claim 28 wherein the base variation includes a deletion. 

30. The method of claim 28 wherein the base variation includes a insertion. 
20 31. The method of claim 28 wherein the base variation includes a transition. 

32. The method of claim 28 wherein the base variation includes a transversion. 

3 3 . The method of claim 1 wherein discordance includes a single nucleotide variant 
25 between the capture and the reporter or in the repeat region. 

34. The method of claim 1 wherein discordance includes base variations which are 
greater than a single nucleotide between the capture and the reporter or in the repeat 
region. 

30 
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35. The method of claim 1 wherein discordance includes base variation plus overlap 
between the capture and the reporter or in the repeat region. 

36. The method of claim 1 wherein discordance includes base variation plus gap 
5 between the capture and the reporter or in the repeat region. 

37. The method of claim 1 wherein the number of bases in the repeat \mit is 2-50. 

38. The method of claim 37 wherein the number of bases in the repeat unit is 3-6. 

10 

39 The method of claim 1 wherein the number of repeat xmits including 
microvariants is 5-50. 

15 40. The method of claim 1 wherein the number of repeat units including 

microvariants is 2-2500. 

4 1 . The method of claim 1 wherein the number of loci analyzed at one time is one, 

20 42. The method of claim 1 wherein the number of loci analyzed at one time is three. 

43. The method of claim 1 wherein the number of loci analyzed at one time is less 
than five. 

25 44. The method of claim 1 wherein the number of loci analyzed at one time is less 

than ten. 

45. The method of claim 1 wherein the number of loci analyzed at one time is 
thirteen. 

30 



wo 99/43853 PCT/US99/03175 

42 

46. The method of claim 1 wherein the number of loci analyzed at one time is less 
than twenty. 

47. The method of claim 1 wherein the number of loci analyzed at one time is less 
5 than one hundred. 

48. The method of claim 1 wherein the hybridization stability is determined, at least 
in part, by electronic stringency (ESC) control. 

1 0 49. The method of claim 1 wherein the hybridization stability is determined, at least 

in part, by thermal regulation of stringency. 

50. The method of claim 1 wherein the hybridization stability is determined, at least 
in part, by chemical regulation of stringency. 

15 

5 1 . The method of claim 1 wherein the hybridization stability is determined, at least 
in part, by electronic stringency (ESC) and thermal control, 

52. The method of claim 1 v^iierem the hybridization stability is determined, at least 
20 in part, by electronic stringency (ESC) and chemical control. 

53 . The method of claim 1 wherein the hybridization stability is determined, at least 
in part, by electronic stringency (ESC), thermal and chemical control. 

25 54. The method of claim 1 further includmg electronic stringency conditions during 

the hybridizations of capture probe with the nucleic acid target. 

55. The method of claim 54 whereby the unique flanking sequence is hybridized in 
proper index with the target, 

30 
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56. The method of claim 54 whereby the electronic stringency conditions ensures 
proper indexing of the repeat units. 

57. The method of claim 54 whereby the electronic stringency conditions restricts 
5 slippage of the repeat units. 

58. The method of claim 54 wherein initial hybridization step occurs in 10 minutes 
or less. 

10 59. The method of claim 58 wherein initial hybridization step occurs in 5 minutes 

or less. 

60. The method of claim 54 wherein initial hybridization step occurs in 1 minute or 

less. 

15 

6 1 . The method of claim 54 wherem electronic stringency control includes electrical 
currents and buffers. 

62. The method of claim 1 further including stringent conditions during the 
20 hybridizations of capture probe with the nucleic acid target. 

63. The method of claim 62 wherein the stringent conditions include thermal 
regulation of stringency. 

25 64. The method of claim 62 stringency conditioning include thermal and electronic 

regulation of stringency. 

65. The method of claim 62 stringency conditionmg include chemical regulation of 
stringency. 

30 
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66. The method of claim 62 wherein the stringent conditions chemical and electronic 
regulation of stringency. 

67. The method of claim 62 wherein the stringent conditions chemical, thermal and 
5 electronic regulation of stringency 

68. The method of claim 1 further including electronic stringency conditions during 
the hybridization of the reporter probe with the capture probe nucleic acid target hybridization 
complex. 

10 

69. The method of claim 68 whereby the unique flanking sequence is hybridized in 
proper index with the target. 

70. The method of claim 1 whereby the electronic stringency conditions ensures 
1 5 proper indexing of the repeat units. 

71. The method of claim 1 whereby the electronic stringency conditions restricts 
slipp^e of repeat units. 

20 72. The method of claim 68 wherein initial hybridization step occurs in 10 minutes 

or less. 

73. The method of claim 68 wherein initial hybridization step occurs m 5 minutes 
or less. 

25 

74. The method of claim 68 wherein initial hybridization step occurs in 1 minute or 

less. 



75. The method of claim 68 wherein electronic stringency conditions includes 
30 electrical currents and biiflfers. 
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76. The method of claim 1 further includmg stringent conditions during the 
hybridizations of the reporter probe with the capture probe nucleic acid target hybridization 
complex. 

5 77. The method of claim 76 wherein the stringency includes thermal regulation of 

stringency. 

78. The method of claim 76 wherein the stringency includes themial and electronic 
regulation of stringency. 

10 

79. The method of claim 76 wherein the stringency includes chemical regulation of 
stringency. 

80. The method of claim 76 wherein the stringency includes chemical and electronic 
1 5 regulation of stringency. 

8 1 . The method of claim 76 wherein the stringency includes chemical, thermal and 
electronic regulation of stringency. 

20 82. The method of claim 1 wherein the hybridization complex is labeled. 

83. The method of claim 82 wherein the step of determining concordance and 
discordance includes detecting of the amounts of labeled hybridization complex at the test 
sites. 

25 

84. The method of claim 83 wherein the detecting is imaging. 

85. The method of claim 84 wherein the imaging is optical imaging. 



30 86. The method of claim 84 wherein the imaging is electronic imaging. 



10 
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87. The method of claim 84 wherein the imaging is CCD imaging. 

88. The method of claim 84 wherein the imaging is integrated optical imaging. 

89. The method of claim 83 wherein the imaging detection is quantified. 

90. The method of claim 1 further including a statistical analysis step. 

9 1 . The method of claim 82 wherein the labeled portion of complex is the target. 

92. The method of claim 82 wherein the labeled portion of complex is the capture. 

93. The method of claim 82 wherein the labeled portion of complex is the reporter. 



15 94. The method of claim 82 wherein the labeling is by fluorescent labeling. 

95 . The method of claim 94 wherein the fluorescent labeling is with Bodqy Texas 

Red. 

96. The method of claim 94 wherein the fluorescent labeling is with Bodipy 
20 630/650. 

97. The method of claim 94 wherein the fluorescent labeling is with Lucifer 
Yellow. 

25 98. The method of claim 82 wherein the labeling is by colormetric labeling. 

99. The method of claim 82 wherein the labeling is by chemiluminescent labeling. 

100. The method of claim 1 further including energy transfer between molecules in 
30 the hybridization complex. 
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101 . The method of claim 1 further including fluorescent perturbation analysis. 

1 02. The method of claim 1 00 wherein the energy transfer includes quenching. 

5 103. The method of claim 1 00 wherein the energy transfer includes electron transport 

between donor and acceptor molecules, which is fecilitated by double-stranded match 
hybridization complexes. 

104. The method of claim 103 wherein the transfer is mediated by electron 
1 0 conductance through double-stranded DN A. 

105. The method of claim 1 00 wherein the energy transfer includes electron transport 
which is facilitated by double-stranded match hybridization complexes. 

1 06. The method of claim 82 wherein the label is amplified. 

107. The method of claim 106 wherein the label further includes branched DNA. 

108. The method of claim 1 wherein the hybridization complex is imlabeled. 

109. The method of claim 108 wherein the detection of concordance is based at least 
in part on conduction of electrons along hybridized DNA. 

110. The method of claim 108 wherein the detection of concordance is based at least 
25 in part on mass detection. 

111. The method of claim 1 further including a redundant assay. 

112. The method of claim 1 1 1 wherein the redundant assay is a serial redundant assay, 

30 
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113. The method of claim 112 wherein the serial redimdant assay includes the 
following steps, after at least the step of determining concordance and discordance, 

increasing denaturation stringency to remove said reporter probe at all sites, 
whether concordant or discordant, 
5 hybridizing a second reporter probe where the number of repeat imits in said 

second reporter probe differs fix)m the number of repeat units in said reporter probe, 

wherein the location of the concordant test site indicates the number of repeat 
units present in the target, based upon the knowledge of the probes located at that test site, 
wherein at the concordant test site, the number of repeat units in the target equals 
10 the simi of the number of repeat number in the capture probe and the number of repeat units 
in the reporter probe, 

determining concordant and discordant test sites among the hybridization 
complex assays at the test sites as determined at least in part by hybridization stability, and 
comparing these results with the initial complex hybridization assay for 
1 5 confirmation of target repeat unit number. 

1 14. The method of claim 1 1 1 further including the step of repeating the redundant 
assay umtil a statistically significant result is obtained. 

20 115, The method of claim 1 1 1 wherein the redundant assay includes multiple arrays. 

1 1 6. The method of claim 115 further including the step of hybridizing target to at 
least two independent sets of assays, 

hybridizing said reporter to a first set and a second reporter to a second set, 
25 where the number of repeat imits in said reporter probe differs irom the number of repeat units 
in the second reporter, 

determining concordant and discordant test sites among the hybridization 
complex assays at the test sites as determined at least in part by hybridization stability, 
wherein the location of the concordant test site indicates the number of repeat units present 
30 in the target, based upon the knowledge of the probes located at that test site, and 



wo 99/43853 PCT/US99/03175 

49 

comparing these results with the two complexes hybridization assay for 
confirmation of target repeat unit number. 

117. The method of claim 116 including the step of simultaneous hybridization of 
5 differently labeled reporters. 

118. The method of claim 117 wherein two reporters differing in the number of repeat 
units in their nucleic acid sequence, and which are differentially labeled, includes the reporters 
then being simxiltaneously provided to the device. 

10 

119. The method of claim 117 wherein the differently labeled reporters are 
distinguishable chromophores. 

120. The method of claim 1 19 wherein the chromophores are fluorescent. 

15 

121 . The method of claim 1 19 wherein the chromophores are luminescent. 

122. The method of claim 1 19 wherein the chromophores are 
electrochemiluminescent. 

20 

123. The method of claim 119 wherein the chromophores include combinations of 
fluorescent, luminescent and electrochemiluminescent materials. 

124. The method of claim 117 wherein the location of the concordant test site 
25 indicates the number of repeat units present in the target, based upon the knowledge of the 

probes located at that test site. 

125 . The method of claim 117 wherein the detected presence of a first label at a first 
test site indicates the known nxraiber of repeat units associated with that reporter and capture 

30 sequence at the test site, which is confirmed by the detected presence of a second label at the 
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second test site which indicates the known number of repeat units associated with that reporter 
and capture sequence at the test site. 

1 26. The method of claim 115 wherein the two sets of arrays are on the same device. 

5 

1 27. The method of claim 1 1 5 wherein the two sets of arrays are on different devices. 

128. The method of claim 1 wherein the target DNA is purified. 
10 1 29. The method of claim 1 wherein the target is unamplified. 

130. The method of claim 1 wherein the target is amplified. 

131. The method of claim 130 wherein the amplification is an exponential methods 
1 5 of target DNA amplification. 

132. The method of claim 131 wherein the amplification includes PGR amplified 
DNA. 

20 133. The method of clahn 131 wherein the amplification includes strand displacement 

amplification (SDA) amplified DNA. 

134. The method of claim 1 3 1 wherein the amplification includes linear methods of 
DNA amplification. 

25 

135. The method of claim 134 wherein the amplification includes rolling circle 
amplification. 

136. The method of claim 134 wherein the amplification includes transcriptional run- 
30 off amplification. 
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137. The method of claim 1 wherein the target DNA is unpurified. 

138. The method of claim 137 wherein the target is nnamplified. 

139. The method of claim 137 wherein the target is amplified. 

5 

140. The method of claim 139 wherein the amplification is an exponential method of 
target DNA amplification. 

141. The method of claim 140 wherein the amplification includes PCR amplified 
10 DNA. 

142. The method of claim 140 wherein the amplification includes SDA amplified 
DNA. 

15 143. The method of claim 139 vviierem the amplification is a linear method of DNA 

amplification. 

144. The method of claim 143 wherein the amplification includes rolling circle 
amplification. 

20 

145. The method of claim 143 wiierein the amplification includes run-off 
amplification. 

146. The method of claim 1 wherein the plurality of hybridization complex assays 
25 arrayed at the test sites include at least one site for each allele of a locus. 

147. Hie method of claim 146 wherein the allele includes an integral number of repeat 

units. 



30 



148. The method of claim 146 \^iierein the allele mcludes an integral number of repeat 
units plus at least one microvariant. 
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1 49 . The method of claim 1 wherein each concordant test site identifies the repeat unit 
number of the target. 

5 150. The method of claim 1 wherein all discordant test sites identify the repeat unit 

number of the target. 

151. The method of claim 1 wherein the number of concordant test sites per locus 
array is one or two for a homogeneous sample in a nonredundant, multiple array redimdant 

10 or serial redvmdant assay, 

1 52, The method of claim 1 wherein the number of concordant test sites is more than 
one for a mixed sample. 

15 153. The method of claim 1 17 wherein the number of concordant test sites is more 

than one for a simultaneous hybridization of differently labeled reporters redundant assay. 

1 54. The method of claim 1 wherein the nature of the hybridization complex may be 
resolved into match, mismatch/gap, and mismatch/overlap. 

20 

155. The method of claim 1 54 wherein the resolution is determined from the signal 
intensity of the hybridization complex assay. 

156. The method of claim 155 wherein the resolution is determined from gradient 
25 application of a selection of electronic, chemical and thermal stringency conditions and 

resulting change in the signal intensity of the hybridization complex assay. 
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157. The method of claim 154 wherein the resolution is determined from the 
knowledge of the probes present in the hybridization complex assay. 

30 
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1 58. The method of claim 1 wherein the number of test sites is less than providing a 
test site for each allele. 



159. The method of claim 1 wherein the target is applied to a reduction of test sites 
5 necessary to identify the nximber of repeat units in the target DNA. 

160. The method of claim 159 wherein the reduction increases the statistical 
significance of results. 

10 161, The method of claim 1 wherein the target material constitutes homozygous allele 



for a locus. 

15 

163. The method of claim 1 wherein the target material constitutes more than one 
allele per locus for a mixed sample. 

164. The method of claim 163 wherein the mixed sample further includes sample 
20 from more than one individual. 

165. The method of claim 164 v^dierein the mixed sample further includes tumor tissue 
mixed with normal tissue. 

25 166, The method of claim 165 wherein the tumor tissue is normal tissue. 



for a locus. 



162. 



The method of claim 1 wherein the target material constitutes heterozygous allele 



167. 



The method of claim 165 wherein the tumor tissue is diseased tissue. 



168. 



The method of claim 1 wherein the target material includes tumor tissue. 



30 



169. 



The method of claim 1 wherein the target material includes plant material. 
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170. The method of claim 1 wherein the target material includes animal material. 

171. The method of claim 170 wherein the animal material is mammal material . 

1 72. The method of claim 1 70 wherein the animal material is human material. 

1 73 . The method of claim 1 70 wherein the target material includes bird material. 

1 74. The method of claim 1 70 wherein the target material includes fish material. 

175. The method of claim 1 wherein the target material constitutes microbial material. 

1 76. The method of claim 1 75 wherein the microbial materials is viral. 

1 77. The method of claim 175 wherein the microbial materials is bacterial, 

178. The method of claim 175 wherein the microbial materials is protozoa. 

179. The method of claim 1 wherein the method is used for identification. 

1 80. The method of claim 1 wherein the method is used for paternity testing. 

181. The method of claim 1 wherein the method is used for forensics. 

182. The method of claim 1 wherein the method is used for disease diagnostics. 

1 83 . The method of claim 1 wherein the method is used for breeding. 

184. The method of claim 164 wherein the mixed sample fiirther including sample 
fi:om one individual which includes monoclonal and polyclonal cell sources. 
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185. The method of claim 106 wherein the label is amplified by enzymatic label 
amplification. 

186. An active, programmable, electronic biological diagnostic instrument adapted 
for the practice of the method of claim 1 . 

1 87. A method for utilizing electronic techniques in determining the nature of repeat 
units in a genetic target, comprising the steps of: 

providing a plurality of hybridization complex assays arrayed on a plurality of 
test sites, where the hybridization complex assay includes at least: 

a nucleic acid target containing repetitive DNA sequence, 

a capture probe having a first imique flanking sequence and n repeat 

units, where n > 0, complementary to the target sequence, and 

a reporter probe having a selected sequence complementary to the same 

target sequence strand, the reporter including attributes selected firom the group 

consisting of: 

a) wherein the selected sequence of the reporter includes a 
second unique flanking sequence and n repeat units, 
where n > 0, 

b) wherein the selected sequence of the reporter includes 
sequence complementary to a microvariant region of the 
target sequence, 

c) wherein the selected sequence of the reporter includes a 
second unique flanking sequence and n repeat units, 
where n > 0 and sequence complementary to a 
microvariant region of the target sequence, 

wherein the sum of repeat units in capture plus reporter is greater 
than zero, 

the sequence of the capture probe differing at at least two test sites. 
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determining concordance and discordance among the hybridization complex 
assays at the test sites as determined at least in part by hybridization stability, and 

determining the nature of the repeat units in the target sequences based upon 
determination of the concordant site and ftuther based upon the knowledge of the probes 
5 located in the hybridization complex at that site. 

188. A method for determining the nature of repeat xmits in a genetic target, 
comprising the steps of: 

provide a platform for the identification of the genetic target, including a set of 
1 0 probes having a first unique flanking sequence, a variable nimiber of intervening repeat 

units and a second unique flanking sequence, 

hybridizing the genetic target with the probes, including electronic hybridization 
stringency conditions during the hybridization, whereby the unique flanking sequence 
is properly indexed with the target, and 
15 determining concordance and discordance at the test sites as determined at least 

m part by electronic hybridization stability, 

whereby a loop out is formed in discordant hybridizations, thereby providing 
energetically less favorable condition than in the concordant hybridization, 

20 189. A method for determining the nature of repeat units in a genetic target, 

comprising the steps of: 

provide a platform for the identification of the genetic target, including a set of 
probes having a first unique flanking sequence, a variable number of intervening repeat 
units and a second imique flanking sequence, 
25 hybridizing the genetic target with the probes, including electronic hybridization 

stringency conditions during the hybridization, whereby the xmique flanking sequence 
is properly indexed with the target, and 

determining concordance and discordance at the test sites, 
whereby a loop out is formed in discordant hybridizations, thereby providing 
30 energetically less favorable condition than in the concordant hybridization. 
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190. A method for detennining the nature of repeat units in a genetic target, 
comprising the steps of: 

provide a platform for the identification of the genetic target, including a set of 
probes having a first unique flanking sequence, a variable nxmiber of intervening repeat 
5 imits and a second xmique flanking sequence, 

hybridizing the genetic target with the probes, and 

detennining concordance and discordance at the test sites as determined at least 
in part by electronic hybridization stability, 

whereby a loop out is formed in discordant hybridizations, thereby providing 
1 0 energetically less favorable condition than in the concordant hybridization. 

191. A method for detennining the nature of repeat units in a genetic target, 
comprising the steps of: 

providing a platform for the identification of the genetic target, including probes 
1 5 selected fi:om the group consisting of: 

1) a probe having a first unique flanking sequence, an intervening repeat 
region and a second unique flanking sequence, and 

2) a sandwich assay comprising a capture probe having a first unique 
flanking sequence and n > 0 repeat units and a reporter probe having n 

20 ' > 0 repeat vmits in sequence with a second unique flanking sequence, 

wherein the sum of the repeat units is >0, 
hybridizing the target with the probes under electronic stringent conditions so as 
to provide proper indexing, and 

detemmiing concordance and discordance at the test sites as determined at least 
25 in part by hybridization stability, 

wherein hybridization stability includes electronic hybridization stability. 

192. A method for determining the nature of repeat imits in a genetic target, 
comprising the steps of: 

30 providing a plxiraiity of hybridization complex assays arrayed on a plxirality of 

test sites, where the hybridization complex assay includes at least: 
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a nucleic acid target containing repetitive DNA sequence, 

a capture probe having a first unique flanking sequence and n repeat 
units, where n > 0, complementary to the target sequence, and 

a reporter probe having a selected sequence complementary to the same 
target sequence strand, the reporter including a second imique flanking sequence, 
and n > 0 repeat units, 

wherein the sum of repeat units in capture plus reporter is greater than 

zero, 

the sequence of the capture probe diflfering at at least two test sites, 
determining concordance and discordance among the hybridization complex 

assays at the test sites as determined at least in part by hybridization stability and 

determining the nature of the repeat imits in the target sequences based upon 

determination of the concordant site and further based upon the knowledge of the probes 

located in the hybridization complex at that site. 
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Overlap 
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FIG. 22A 



Signal for Gap, Match and Overlap 
after 3 minutes of Denaturatiori 




Capture Oligo 

FIG. 22B 
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after 10 minutes of Denaturatlon 
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